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BIOMETRIKA. 


ON THE PROBABLE ERRORS OF FREQUENCY 
CONSTANTS. 


PART ILI. 


EDITORIAL. 


(1) THE probable errors of frequency constants for a single variable have been 
discussed in an elementary manner in Vol. I. pp. 273—281 of this Journal. The 
following notes on the probable errors of frequency constants of distributions with 
two variables have been provided at the request of certain of our readers*, 


All the constants of a frequency distribution of two variates are expressible in 
terms of the higher product moments defined by 
Gis ae Dsl Ya Oh NN maa tiie oh ott sasha sedans seh (1), 


where the origin is at fived values of « and y. ‘Transferred to the mean @, 7, we 
have: 


De nan a =) Yap —— G24 ING « cotaedeosteame 545.00 wien (ii), 


where # and 7 will vary from sample to sample. Here ny is the frequency of 
individuals with characters 2, and yy, and WN is the total population. 


Throughout 74g, will denote the correlation between two quantities a and 8; 
o., @p Will denote the standard deviations of a and £. 


The following results are well known: 


é Nsgs! Siete! 
On. = New (1 = *) eseatute nemae dinstgeaee oxaiead (111), 
Oo, O77 Ss Ns (iv) 
ie ong nny 7 Ra ; 


* Reproduced from Lecture Notes. 
Biometrika 1x 1 
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Further if n,=S(nsy) for all values of s’, 


Ns Ny 
Cy ts Ny Ng ie aes Ne shale fe JeteKolstetanars) stete/overstetatelReRstatetels BAGH) 
Ng Nes! 
nF yy ngtyy = Petree Re fob (vi), 
Ns a 
on 1g 7 0g Se Ng Ng! = Ney’ (1 = *] aca)aj sia: ais iets lafetotetelotet safer steve ..(Vil). 


See Biometrika, Vol. Vv. pp. 191—2. 
(2) We have, if 6 denote a variation in any frequency constant due to random 


sampling, 


Oy Opes (CV MALO! Ganpnceptiaccoss Sere (vil) bis, 


and if m be number of random samples: 


NACE ee eK 899,q')° 
=8 (77, aie Ys ’) ot 28 (c, Try ng of Nay! x VLA Y st Yet ) 
Neg Ng pr 
. ¢ SS Fett . he H 
= Sls (.-' ny ve) aoe Ty" “gy _ OS 25 W = nstndlys ye) 


= N (pxq,27 — Pa Pad): 


Thus Ong = Pet E Dad eee ss coat oe (QUALI): 
= Q 7 
Again NSpy w= S (ON le Ye" ); 
J2 na p Itt may Yq, ai 2 1 gt, q+’ 
N°c 7 pa, q on wu Tne, UPuw =8 (oy Trg! ‘Mss! Ney x os HYs ; It ) +8 (o Mg U8 Ys Z ) 
= N (pene — Pa,qPuw')s 
— Potu,d+u — Pag Puw : 
thus Gin Ob age Wott eeseeaeeeneet (axa): 


(vil) and (ix) refer only to the higher moment coefficients about a fiwed origin. | 
About the mean we have | | 
NBo,q = {tse (@s — %)" (Ys — Y)"}, 
NOPio,q = 8 [8nss (Ws — B)* (yy — Y)"} — GOLPqa,g — VOY Paga +++ (x). 
Now it is clear that before going further we want to know the correlation 
between variations in 7, and & or y. Now 
Ne=S (ma), 
NOz = S (bn), 
N6HSNgy = S (Ong dN Lt), 


Ng oy (NEN gs! 
Nez? Engg! = Ngo! (1 = a) & -—S ( WV vt 


=Nsy (#;— ), using (vi) and (vii), 


or Onn Van, = ay ee), aes niece pls Sie «eee (xi). 
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Next we want ozo 773,, but this is known, or easily found to be p,/N. 


Returning to (x), squaring, summing for every possible sample and dividing by 
the number of samples, we have 


D. m2 mn. .72 (2 tea? ls nine ait 
N*o? Dag = Bea — Prag + FP 2,0Pa—1,9 + 1 Po,2P 9-1 + 299 Pi Pag X Pada 


— 29D q41,0 Paya — 29 Pag Pog +(x). 
We may now find the correlation between any pair of higher moment 
coefficients, 


N8pu,w =8 {ONgy (as — @)” (Ye — "| — USE Bua — WO Du,w-a «+-(xili). 
Multiplying (x) and (xiii) together, summing for all possible samples and 
dividing by the number of samples and N, we have 


NOG TB ue Bacg Paw = Dating — Bag Pusu + Ps, 0Pq—1,q Pua.u + TW’ Po,-Pq,q a Puw-r 
+ UD, :Py—,qPuw—a + (UP s,1D 9g Pua, 
— Upon, ¢ Pua — UP, ¢ 4 Puwa 
SO Dine ae Dg aig! Gi Dual tay, G1 leven bat tian ses wenses (xiv). 
(xii) and (xiv) contain all the requisite data for the probable errors of random 
sampling in the case of two variables. They, of course, contain implicitly the case 
of one variable, as we have only to put q’ and w’ zero in order to fall back upon the 
formulae (vii) and (viii) of the first part of this paper (Biometrika, Vol. 
pp. 276—7). 
(3) We may illustrate as follows: 
(a) Correlation of errors in means: 
ay Maye uree sie Rncesenerntveuesure ows pias xy)): 
(8) Correlation of errors in standard deviations* : 


Noz Px aay! "D Px Poo = Px — Px» Pres 


40x yTg O 


r = 
Oy TyFy N , 


Pe» = P» Pos 
VB —B 259 Bon NV Bins — —Pn 
This is the general value of the correlation between the standard deviations of two 
correlated variables. We may write it in the form 


r = Pxs/ (Pw Po) — 1 
‘atv (B.—=1) (Ar = 1) 


where 8, and £,' are the second #’s for the two variables respectively 
We may now investigate Px», 


ey re cr ere (xvi). 


, 
CeO ne 


ee il ath — 
Pa = 7 S {(@s — &)? (yy — 7) Nes} 


* We may safely write poo for po,2 etc. when we leave the general formulae, but pyq_; for pq, q’1 iS 
capable of misinterpretation. 
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Let us sum for s’ and keep a, constant ; then 


= {Cys — YP Nev} = {Ms (e+ (Ys—Y))}, 
where o,, is the standard deviation and 7, the mean of the array of y’s for a given 
a. If the regression be linear and homoscedastic, then 


= ee o Es 
Ys —Y = Vay = (@;—&), 
Ox 
and oy = oy (1-1 ny). 


oe ee ee 
Hence pa aa |v. (a; — @)? (1 — ray) + Ms (a5 — @)4 va 
N Ge 


= 67 Oy {1 — May + xy Bah, 


Pode qth oy (Pi oal 


Similarly Pe» —1 =? y, (8, — 1), 
P20 Poe : 


on the assumption that the other regression is also linear and homoscedastiec. 


It is accordingly impossible for two variables to have a regression linear and 
homoscedastic in both senses unless 8, has the same value for both variables. 
Clearly for most practical purposes we may take 


Dee 72 AN (Oe 
Pon Dor TP ny N(B2— 1) (Bs —1) 


Thus approximately Te (xvii). 


This is identical with the result found by Pearson and Filon (Phil. Trans. 
Vol. 191, A, p. 242) on the assumption of normal correlation. It is now seen to 
be true, whenever we may assert linearity of regression and homoscedastic dis- 
tribution for both variables. 


(y) Probable error of a coefficient of correlation. 
Put Tey = rae 
20 Por 
or ey = opu _1epo 1 Ope 
Vay Pu 2 Px 2 Poe 
Square, add for all random samples and divide by their number: 


ray 1 (pe— Pr? , 1 Pwo - Pw , 1 Pu- Po , 1 Po — PoPee 
Poy IN ? aa me 4. - a By 4 ra a 2 : ave 
ease Ee - Es BaPa ica 
PuPro Pup 
This is the most general value of the standard deviation a, of a correlation 
coefficient. It was first given by Sheppard (Phil. Trans. Vol. 192, A, p. 128, with 
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an obvious printer’s slip, the omission of 7?,,). It is clear that we require a know- 
ledge of at least an approximate value of p,, and p,; as well as p.. in order to 
simplify this expression, which is far too cumbersome for practical use. 


Now NV X Bn =S8 {N59 (@s — 2 (yy — Y)} 


= 8 {ns (Ys — Y) (as — XI, 
if we sum for the sth array of 7s. 


But, if the regression be linear, 
— Vay Fy = 
7 = 22" (w, — 2), 
rut Of 4) 
therefore N X fn = S {ng (as — %)4}, 


Pa = ley Ty De 


Dig Pani oie eee ceis'oe shee oe ead aemeee rade tee es (xix): 
Similarly Prs = Tay Fx Cies, 
ca Th Uh) OF) CR CRORE ee a ee ny (xx). 
We can now substitute in (xviil), if we determine what value to give to p». If 
we take 
Pe = CxO,’ lis Pay + Pay X $ (Bo ar BD}; 
we have Pa Sl Breet at f 
P20 Poe 2 
and Px =— 
Pr 
or bs -l= ee ar 3 (B; —1+ By —1). 
11 ay 
Hence 
ee eee 18) 1 ey 1) ) 
LINE Pay a ar a: Y( +47 ay 
ee 
ay i pie! : sce A = = —7 
We - H@- 14+ 8/-D0-r wh 
1- Mey (4 
ae —2(8,-—1'4+ 8, —1)r*,,} 
_ Tey Py 1 i 9 
= ei ly — 4 (Bi — 3 + Bs — 3) Try}. 
Or Petey age 8 6? — 3) | re (xxi). 


Vey /N l le Pony) 
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This result is of much interest. If the kurtosis be zero, then 


and we have EEN efqoS00ooads (xxil) 


the value originally given by Pearson and Filon for the standard deviation of 
a correlation coefficient when the frequency surface is Gaussian (Phil. Trans. 
Vol. 191, A, p. 242). We see accordingly that: 


(i) Equal kurtosis is needful in the two variates if the regression is to be 
linear and the arrays to be homoscedastic in the case of each variable. 


(ii) The ordinary value subject to (i) is only correct provided the kurtosis 
is zero, and this is true whether the distribution be Gaussian or not. 


(iii) The ordinary formula may give very inaccurate results, if the kurtosis be 
considerable and the correlation high. 


(iv) It is probable that (xxi), as we have taken a mean value for p», gives 
fairly good results even when the correlation is not linear, 


Clearly we must always have 
2 
— 
VB.—1+ By —1 
Or, for linear regression in homoscedastic systems there is a superior limit to the 
correlation possible with given values of the kurtosis. This is an interesting point, 


and forms a remarkable limitation on the nature of double linear homoscedastic 
regression. 


Toy 


(4) We may now find the correlation between a product-moment p,,q about 
fixed axes and Dy,w a product-moment about axes through the centroid. We have 
to multiply together (vii) bes and (xin). We have 


Ngs' 


2 = ee a ot | L — =) wf ayt x (a, — £2)" (yy — G)™ 
NV ODaa’ She w ‘ V Bact By, ul a | ( va) eras (a ) (Ys I) 


C 


Nit! Nss! ; is er 
-S ey Salat X (as — 2!" (yy — ay 


Ngg! 


— Upua,w 8 " (@, — &) £2 Yy% \ 


—WPuwiaS y (yy —Y) vs! yet nee (xxiv). 


We can simplify the form of (xxiv) by taking the fixed axes now through the 
centroid itself. This gives us 


1 
Se a pe Be LD ee De 
Oy. TDicu' Pay Pun = Darwuate ~ Dag Pusu — UPua,w Pory¢ — UPuwaPaas} 
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Illustrations. (a) To find the correlation between a deviation due to random 
sampling in a mean and one in the standard deviation of the same variate. 


Take g=1, 9 =0, +. Pog =e; U=2, Uw =0,-.. Daw = pe- 


FEF.) Tuy = NV ( Pso faa Pio Pr a 2D iP). 


But Pw = 0; Ore o,/VN, 
= = Vy — p pe/VN = Be VB. - = -1/VN, Po = Hs = VB, Ox. 
Hence ep Any aes EIN ON Orsi: Atnam: sancttinad yenatess (xxv). 


This is perfectly general; we see that variations in the mean are independent of 
variations in the variability for all symmetrical systems including the Gaussian. 


(8) To find the correlation between a deviation due to random sampling in 
the mean of one variate and one in the standard deviation of a correlated variate. 


iskemg— 1, q—0, 9 .:: Pag = i = Nt = 2s Dua — Doe pn = Oy 
irs Fy,,1 "Eby FIN Ge PwPo2 7? 2D Pn) 
But Bo=Pu=0, oz=0,/VN, o,., = fe VB/ —1/VN. 
It remains to consider 


a 1 
Pu=W S {ns (ws —@) (ye — ¥)"} 
1 


= 7 5 {its (He — %) (ys — 9"), 


where %y= mean of array of a’s corresponding to the yy of y. Hence if the 
regression be linear 


Pu yy 2072 8 tne ~ DP 
= * fis = Tay Fa oy? V By. 
Thus we have "ie, = Tiny = Tey V BY OR Ol ite eaeratuat tence tere (xxvii). 
Similarly Tg tag N Bi) NB be snatters-neenntinakere (xxviii). 
Clearly Teo CU ais lag Ogee 
and, legge al Vera oes Tye ae 


by (xv), which show us that these correlations are second order correlations, and 
proves that the correlation of the mean of one variate with the variability of 
a second is zero, for constant mean value of the second, since 


8 On the Probable Errors of Frequency Constants 
(y) To find the correlation between a mean and a coefficient of correlation, 
Le. between @ and rz,. 
We have to multiply 6% = 6p, with 
Olay = Spy ae 1 Spey “cal il Spor 
Pay Pu 2 Px 2 Poe ; 
and apply (xxv) to each term. We have 


1 {Pe wig! Pel 


gee | Op Or Ii ti = oc= = 
Tey B VyyS NT Pu - 2 Da 2 Dos 


Fey 
El fimsusev®, LV Bion? 1 Py oxoy2V By) 
= es —, eee 


N 


Vay Fy Ox 2 ox a ul 
1 oy Q’ 
= Woelk (VB: = Pay VBy)}; 


using the values of p., and p,» for linear regression. If we now use the value 
in (xxi) for o,, we have 
ay 


aT ay {VB — Vay v By } 


Vy yE oe ee te (xx1x); 
Ce) lee (snt cab yeh = 8) oe 
\ 1— Pay 
reducing when the kurtosis of both distributions is zero to 
a 1 rey {V Bi — Tey vBy} (xxx) 
aes) i-3, ee ; 


and vanishing for all symmetrical linearly correlated variates, including of course 
Gaussian systems. 


(6) To find the correlation between a deviation in a standard deviation and 
one in a coefficient of correlation. We have to multiply df.) by 
& icy opu i SDoy es 1 ODoo 
Pay Pu 2 Po 2 Doz 


We find 
I _ Pa—PmPu 1 Bo— Px’ 


1 
Gna = et Ta 
Tena Tey Pol ay N' Pu 2 N Px» 2 Ni Poe 


Hence, assuming linearity of regression, we may put 
Pa Pu aa ee Eby Pu Po = Boo oe 
and approximately by the result immediately above Equation (xvii) 


Pro[ Dor — Px = Fx Vxy V(B2 ll) (ery ak) 


Thus 
1 o2VB.—1 Lee A MN a 
Tay VN Ory" Hany {o°(B2— 1) — $o3? (B2— 1) — $00? 1 ny V(Be — 1)(B,’— 1)}. 


Ui 7) 9 Tee 
Or etl poe ane {Vv B2— 1-1 gy V Bs = di; 
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and by (xxi) ee ieeeaa 
2—-1l—TryV Bo —1 


(1— ry) 41 — 3 (@,—3 4 BY — 3) 
tT 


\ 1-1, 


Ut po ee ay sone XXX) 


Since homoscedastic linear regression supposes 8,= £8.’ this result must be very 
close to 


= dry VB:— i/(1 —1(8,-34 B, —3) i ve) re O3.6.60)) 
ay 


for such distributions. 


Onl ay 


For distributions in which the kurtosis is zero (8,=3) we reach 


eae ie NID sr aisles Crete gees yt (xxxlil), 


a xy 
a result already reached for Gaussian distributions by Pearson and Filon in 1897 *. 
It is now shown to be true for all homoscedastic, mesokurtic systems with linear 
regression. 


(5) Two further probable errors are of interest. If we write ¥,=axv+b for 
the regression line, what are the probable errors of 


@= fey Fy{On and b= FY = TyycyX/oz? 


It will be sufficient to give the values of o and o, when the frequencies are 
symmetrical and the regression linear. In this case 


Uae zi "Fo = "roy = "Fon - 0, "ay = Try? 
0 : ss 
Vey Ss Tay? y Vey Fa y MoyFy /2 Tay, 9 Ty Ty Way: 
Writing 7,=2%/oz, we have 
O0 960, Oley Oo 
— + ae 
Geet Ta, Oy 
and 8b = 8Y — 06% — Ol gy Cy Tx — Vry Tx Oy + AT OCz. 
Whence proceeding in the usual manner we deduce 
Bagel OO ere Bay TRA . 
Cia ee V1— Pay [WN Bee RMS os toma eihaeae PAXKRIV AG 
Op = Oy Ale ray V1 + T2/VN = Oy NEEA Oy ceceeeees (XxXxV). 


These enable us to determine any significant difference between two regression 
lines. 


We can go a stage further and ask what is the probable error of the mean of an 
array, Yz, as found from the regression line. We have 
SY, = xda + Ob. 


* Phil. Trans. Vol. 191, A, p. 242. 
+ Cf. Phil. Trans. Vol. 191, A, p. 245. 
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We require accordingly cacpas. Multiply the above expressions for 6a and 6b 
together, summing and dividing by the number of samples we find 


Tz y- , : 
CaF abd = — ae (= Py) cence se econ XXXVI). 
aSbla No» ( a ( ) 
V1 —9° (a —@)?)2 
Whence we deduce Go ee ! + - } vais divans eee XXXVil). 
Ya J N ox ( ) 


We note the increase of inaccuracy of the means of arrays far from the mean of the 
whole population *. 


The above equations embrace the chief results for the probable errors and 
errors’ correlations of systems of two variables. They have been reached inde- 
pendently of any system of Gaussian distribution. 


* Results (xxxv), (xxxvi) and (xxxvii) are published here for the first time. 


THE RELATIONSHIP BETWEEN THE WEIGHT OF THE 
“SEED PLANTED AND THE CHARACTERISTICS OF 
THE PLANT PRODUCED. I*. 


By J. ARTHUR HARRIS, Ph.D., Carnegie Institution of Washington. 


I. IntTRopuctoRY REMARKS. 


IN practical agriculture the quality of the seed planted is universally recognized 
as of fundamental importance. Three requisites are essential: freedom from 
noxious impurities, purity of breed and viability. All three of these points have 
attracted close attention, and have a voluminous literature. But given seeds 
belonging to the required variety, free from undesirable impurities and germinating 
successfully, it seems of theoretical interest at least, and perhaps of much practical 
importance as well, to ascertain the degree of relationship between the size of the 
seed planted and the characteristics of the individual into which it develops. 


This question although by no means so extensively discussed as the others has 
received considerable attention. Much has been written concerning the desirability 
of winnowing seed to remove the lighter grains. A review of the literature of 
this field would lead us too far from our immediate purpose, which is to present 
the data derived from some rather extensive quantitative investigations. 


It will not, however, be unprofitable to call attention to certain general 
deficiencies of the previous work, especially since this will define the point of view 
directing the studies described here. 


(1) In many cases the distinction between perfectly matured but small seeds 
and potentially large but immature, blighted or shrivelled seeds has been dis- 
regarded. 


(2) The method of grading the seed has, generally speaking, been neither 
uniform nor logical. Usually the separation has been only into “heavy” and 


* This first paper is limited to the presentation of the data for number of pods per plant in twenty 
series of garden beans belonging to three different varieties. Much material for other varieties is in 
hand. The constants for other characters, e.g. number of ovules and number of seeds per pod and 
seed weight, are being calculated and will be presented later with more general discussions, 

a) 
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“light,” or at most into “heavy,” “medium” and “light.” The meaning of such 
crudely defined terms of course differs from experiment to experiment. They 
may, with certain limitations, enable one to say which class of seeds gives the 
best results. They do not permit comparisons of the advantages to be gained by 
seed selection in different varieties. They do not allow of the writing of a general 
formula enabling one to predict the yield from seed of given weight. Yet such a 
formula is precisely what is needed. The advantage of sorting seed—if there be 
an advantage-—depends not only upon the increased yield (or increased uniformity 
of the crop in certain cases) but also upon the cost of carrying out such a selection. 
In deciding upon the stringency which is profitable in the seed selection, the 
practical breeder should know the exact weight to be attached to each factor. 


(3) The experiments have not been carried out in a way to make possible 
the calculation of the probable errors of the results. In many cases the experi- 
ments have been small, and the conclusions are open to serious question, both on 
the ground of possible experimental errors and on the ground of the probable 
errors of random sampling. 


All of these difficulties can be overcome by the application of the modern 
statistical methods to the problems. A wide series of such biometric constants 
deduced from carefully conducted physiological investigations ought to be of great 
service to the man dealing with the practical problems of agriculture. 


II. Materiat AND MeErTHops. 


The materials for this study are drawn from three varieties of garden beans— 
the White Flageolet, White Navy and Ne Plus Ultra. Altogether there are 
twenty series grown under most diverse environmental conditions. Their history 
has already been given* for another purpose and need not be repeated here. The 
symbols used to designate the different lots are the same in both papers; hence 
the reader may make any comparisons which he sees fit. 


All of the seeds planted were, as far as could be determined by inspection, 
perfect in form and development. Each was individually weighed and classified 
in a uniformly graduated scale. These individually weighed and individually 
labelled seeds were then mixed and planted at random in rows by varieties, the 
rows in their turn being scattered over the field, more or less at random, in order 
to counteract by chance distribution any influence of the possible heterogeneity of 
the substratum upon the characters of the plants. All the determinations of 
number of pods produced were made on individual plants. 


Thus the probable errors for the statistical constants for weight of seed 


planted, number of pods per plant, and for the correlation between the weight of 


* Harris, J. Arthur: ‘A First Study of the Influence of the Starvation of the Ascendants upon the 
Characteristics of the Descendants.” Amer. Nat., 1912. In press. 
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the seed planted and number of pods per plant, can be determined. The corre- 
lations between weight of seed planted and number of pods produced are com- 
parable from variety to variety or from cultural condition to cultural condition. 


To render the results more intelligible, the straight line regression equations 
have also been calculated and represented in a series of diagrams. These show by 
the slope of the lines the (smoothed) change in mean number of pods per plant 
associated with changes in the weight of the seed planted. 


The reader unacquainted with higher statistical methods need only remember 
that the coefficient of correlation describes the degree of interdependence between 
two variables on a scale of —1 to +1. This measure of interdependence is, there- 
fore, quite independent of the magnitude and of the variability of either or both 
of the characters in question. The regression coefficient, on the other hand, shows 
the absolute amount of change in a second character y consequent upon a change 
of one unit on the scale of the first character 2 Concretely, in our present case, 
the regression coefficient shows the absolute increase (or decrease) in number of 
pods per plant associated with an increase (or decrease) of one unit in the weight 
of the seed planted. “Increase” or “decrease ” 
condition in the population as a whole. 


is measured from the average 


The correlation coefficient is fully justified as a measure of interdependence 
only when regression is linear, that is to say, when the mean value of y increases 
at a uniform rate throughout the whole range of # Where regression is not 
strictly linear, the coefficient of correlation still furnishes in many cases a very 
satisfactory measure of the intensity of relationship between two variables. This 
is true in the present case. 


All the weighings were made on seeds which had dried for several months at 
laboratory temperature. Drying at high temperatures was of course precluded by 
the fact that the seeds were to be used for planting. Drying in a vacuum over 
sulphuric acid could not be undertaken because of the excessive labour involved 
where each seed had to be followed individually throughout the whole work. 
The weight unit adopted was ‘025 gram. Hence to obtain means and standard 
deviations of weights in grams deduct 5 from values in tables aud multiply by 025. 


The correlation tables showing the relationship between the weight of seed 
planted and the number of pods produced are entirely too bulky for publication. 
It is possible, however, to present the essential data by showing the total number 
of pods produced by each grade of seed weight (Tables III—VI). A convenient 
method of calculation for such cases has been suggested elsewhere *. 


In deducing the correlations from such tables, the means and standard devia- 
tions for the two characters involved are required. The distributions of numbers 
of pods per plant for the twenty series have already been publishedt for a quite 

* Harris, J. Arthur: “The Arithmetic of the Product Moment Method of Calculating the Coefficient 


of Correlation.” Amer. Nat. Vol. xurv. pp. 693—699, 1910. 
+ Amer. Nat. 1912. In press. 
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different purpose. The tables are very large and need not be given here. The 
physical constants deduced from them appear in Table I. The distribution of the 
seed weight is shown in the frequency columns of the reduced correlation tables. 
The physical constants* for seed weight are given in Table II. 


III. ANALysis oF DATA. 


1. Number of pods per plant in Navy, White Flageolet and Ne Plus Ultra +t. 


From Tables III to VI the value of the rough product moment = (w’ p’) about 
0 as origin may be calculated straight away by multiplying up the total pods by 
the number of the weight class (in parentheses) and summing. The coefficient of 
correlation, 7, is then deduced from the formula 
> (w'p'/N — Op 


Tup = ue 
won 


while the equation for the regression straight line is given by 


p= (p Twp wo) Tp 2 Ww 
Sat TU hs ae — f wp 
ow ow 


where p and w represent weight of seeds and pods per plant, the bars denote the 
population means of the respective characters, and the sigmas their standard 
deviations. The variable p is integral and there is no need for grouping; the 
unit of w is ‘025 gram, with class 1 ranging from 0 to ‘025, and centered at 
0125 gram. 


The correlation coefficients and regression equations are given in Table VII. 


The straight line regression equations being available for all series, it is only 
necessary to determine the empirical means for number of pods per plant to test 
by graphical methods the linearity of the regression of pods per plant on weight 
of seed planted. ; 


It is not feasible—because of the rather great labour demanded, and the com- 
plexity of the diagrams—to do this for all the series. In the diagrams (Figures 
1—8) the slope of all the twenty lines given by the equations is shown. In the 
case of certain of these lines, selected quite at random, the values of the empirical 
means are indicated in the usual way. 


The empirical means are scattered with some irregularity and from inspection 
alone one might suspect that regression is not strictly linear}. In other words, 


* Sheppard’s correction was applied in calculating the moments for seed weight, but not in obtaining 
those for number of pods per plant, since the latter varies discretely. 

+ Other series of material, and the relationships for seed weight and characters other than number 
of pods are being reduced. 

+ The sensitiveness of the number of pods per plant to environmental conditions (and consequently 
the great variability in the means, especially near the ends of the range of seed weight) is so great that 
it has seemed inadvisable to attempt more refined mathematical treatment of the problem of linearity 
of regression. I hope to do this later on larger series of material. 
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the mean number of pods may not increase at the same rate from the lowest to 
the highest, the lightest to the heaviest, grade of seeds. Practically speaking they 
may be considered to do so. 


The coefficients of correlation are rather small, ranging from about ‘120 to 
280. The average of the twenty series is ‘1615. It may be noted, however, that 
they are in every instance positive, thus indicating that the selection of larger 
seeds will give a somewhat higher yield of pods. 


The second term of the regression equation enables us to read off at once the 
increase in the number of pods per plant to be secured by selecting seeds one unit 
(i.e. 25 mg.) or more above the average. These values range widely, as is to be 
expected from the fact that the crops upon which they were calculated were pur- 
posely grown under most diverse cultural conditions. The slopes of the regression 
lines will make clear to the eye the advantages to be secured by planting heavier 
seed. 


Co~p Sprina Harpor, U.S.A, 
January 29, 1912. 


TABLE I. 


Pods per Plant. 


ee Total Men cana Standard Deviation | Coefficient of Variation 
meee Plants Probable Error and ane 
Probable Error Probable Error 
NHH ASE 1484 16°99 +°15 8-67+£°11 51°02+0°78 
NHHE ... 1271 11°93 +°10 5°17 +07 43°29 +0°68 
NHD eas 1416 3°97 + °03 1:94+ 02 48°97 40°76 
NEL Dee 1204 4°58 + ‘05 2°38 + °03 51°84+40°88 
NDD Sin 513 3°59 + °06 1°89 + ‘04 52°66+1°38 
NDDD ... 459 4°41 + -06 1:93 + 04 43°86+1°15 
NDH ae 670 14°62 + °21 8°244+°15 56°39+1°33 
NDHH Ses 565 11°83 4:14 4°96 +°10 41°94+0°98 
FSS nee 868 15°03 4°17 741+ 12 49°34+0°97 
FSC eat 586 14°22 + ‘21 7°38+4°15 51°90+ 1°27 
FSH ite 475 17°29 + °25 7:94+°17 45°89 +1°20 
HSH Se 429 11°84+°16 4°80+°'11 40°50 +1°08 
FSD oe 428 3°43 + ‘06 1°69 + 04 49°42+1:°39 
FSDD we 387 4:04 + ‘06 1°73 +'04 42°85 +1:°22 
USS ae 680 15°74+°16 6°04+'11 38°37 40°80 
USC wee 530 10°14+°13 4°38 + °09 43°23 +1°05 
USH e, 361 14°04+ °20 5°55 +°14 39°56+41°14 
(OME EE eee 224 8°444°15 3°24+°10 38°45 + 1°25 
USD nets 312 2°59 + °05 1:23 + :038 47°3041°54 
USDD _... 237 3°62 + ‘09 2°10+°07 57°97 42°32 
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TABLE II. 
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T Standard Deviation | Coefficient of Variation 
ae otal Mean and 
perica Seeds Probable Error and and 
Probable Error Probable Error 
NH 1484 9°44+°03 1°51 +:02 15°96 + °20 
NHHIT 1271 9°27+°03 1°33 + ‘02 14°36 + °20 
NHD 1416 9°46 + 03 1°51 + 02 15°93 + °21 
NHDD 1204 8°46 4°08 1°41 +02 16°66 + °24 
NDD 513 1 teaxt 04 T3203 Ode 37 
NDDD 459 8°76 + °04 1°30 + °038 14°89 + °34 
NDH 670 7°62+°03 1°28 + ‘02 16°83 + °32 
NDHH 565 8°93 + °04 1°24+°08 13°93 + °28 
FSS 868 8:20 + °03 1°43 + -02 17°48 + 29 
FSC 586 8°31 +404 1°40 + 03 16°8l+°34 
FSH 475 8°33 4°04 1°44+ ‘03 17°28 + '°39 
FSHH 429 8°48 + 04 1:07 + 02 12°67 + 30 
FSD 428 8°22 +°05 1:43-2:03 17°38 + °41 
FSDD 387 C19 °04 1:04 + :03 14°52 + °36 
USS 680 14°14+ :07 2°71 + °05 19°18 + °36 
USC 530 14°50 + ‘08 2°58 + 05 17°82 + °38 
USH 361 14°33 + °09 2°65 + :07 18°50 + 48 
USHH 224 13°94+ °09 1:93 4:06 NB tepyse 4!) 
USD 312 14°39 +°10 2°63 + °07 18°28 + °51 
USDD 237 10°85 + 08 1°78+:06 16°37 4°52 
TABLE III. 
Series NHH Series NHH H Series NHD Series NHDD 
Weight of 
Seed Planted a =a = 
F Total Pods F Total Pods F Total Pods F Total Pods 
050—‘075 ( 3) — — — — 1 2 2 4 
075—*100 ( 4) 1 5 = = = = 3 9 
*100—"125 (5) 3 36 — —- 3 10 14 54 
125—'150 ( 6) 23} 306 10 89 18 57 80 350 
150—'175 ( 7) 96 1439 95 1015 89 294 230 1017 
175— ‘200 ( 8) 281 4102 265 2932 276 1019 319 1382 
200—'225 ( 9) 401 6614 376 4583 386 1559 285 1317 
225-—:250 (10) 330 6200 308 3701 298 1235 157 782 
*250—'275 (11) 216 4040 151 1959 Aah 860 80 401 
*275— °300 (12) 94 1734 51 668 95 413 25 153 
*300— ‘325 (13) 0) 545 8 106 28 117 6 29 
*3825—°350 (1h) 9 119 4 63 8 40 3 19 
*350—°3875 (15) 5 76 2 39 3 1133 — = 
*3875—:400 (16) — — 1 9 = = — — 
Totals 1484 25216 1271 15164. 1416 5619 1204 Dols 
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TABLE IV. 


Series NDD Series NDDD Series NDH Series NDHH 
Weight of 
Seed Planted 
F | Total Pods F Total Pods F Total Pods if Total Pods 
*050— 075 ( 8) | — — = — 1 1 2 
‘O75—100 ( 4) 3 6 2 4 8 122 = = 
"100—°125 ( 5) 25 65 4 17 26 326 1 17 
*125—°150 ( 6) 53 152 15 47 76 814 1B} 116 
USO 1754 Wa) 43 481 44 195 198 2653 42 427 
“175—"200 8) 151 564 124 524 209 3103 160 1847 
-200—-225(9)| 87 | 341 147 616 104 1890 164 1963 
ee ee (10) 42 175 87 420 37 641 130 1613 
*250— 275 (11) 8 55 Dif 140 9 188 42 546 
25 +300 (12) 1 4 6 42 2 49 9 123 
3800—-'325 (13) —_ — 3 18 — — 3) 28 
325— 350 (14) | — = = ae = = is ee 
300 o lon (Loy = — = — — — — — 
‘375—'400 (16) — = = = = 
Totals bile: 1843 459 2023 670 9794 565 6682 
TABLE V. 
Series FSS Series FSC Series FSH | Series FSHH| Series FSD | Series FSDD 
Weight of 
ge Total’ || oe total Total Total Total Total 
ota ota ota ota ota ota 
F Pods F Pods F Pods F Pods F Pods F Pods 
050==076:(3) | —— = 1 20 2 4 1 3 
O075—:100 ( 4) 6 96 1 5 2 13 1 15 3 9 2 4 
*100— 125 ( 5) 21 275 15 173 9 139 3 26 10 30 10 40 
125—150 { 16) fe) 1002 40 463 43 652 13 155 25 63 79 286 
150— ‘175 ( 7) | 170 2432 98 | 1238 80 | 13869 43 445 78 252 166 676 
175— 200 ( 8) | 241 3566 | 168 | 2355 | 116 | 2049 | 166 | 2023 | 138 498 84 356 
200—°225 ( 9) ' 194 3065 | 154 | 2497 | 131 | 2265 | 131 | 1493 | 102 364 36 155 
*225— ‘250 (10) | 118 1827 72 | 1164 64 | 1207 60 760 43 152 8 41 
ae 275 (11) 36 627 ol 420 Py alll 10 133 20 57 1 1 
75— ‘800 (12) 10 153 6 69 5 90 2) 30 Hi 37 — — 
ie 825.13) =7| = 1 i — 
| 
Totals 868 13043 | 586 | 8334 | 475 8215 | 429 | 5080 | 428 | 1466 387 | 1562 
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TABLE VI. 
Series USS Series USC Series USH | Series USHH | Series USD | Series USDD 
Weight of - i 
eoaumeuauted RP Total RP Total P Total RP Total RP Total RF Total 
Pods Pods Pods Pods Pods Pods 
‘O75—'100 ( 4) | — = 1 15 — = = <= = 
*100—"125 ( 5) | — —_— 1 3 = = -- — —_— 
"125— 150 ( 6) | — — 1 3 2; 7 
*150—'175 ( 7) | — — — —_ 1 16 5 7 
"175—'200 ( 8) | — -— 2 7 1 13 | 113} oD 
*200—‘225 ( 9) 8 98 4 27 1 16 = = 5 11 ill 91 
*225—'250 (10) 18 | 236 5 42 10 128 5 28 33 7 49 161 
*250—:275 (11) 57 943 O7, 952, Pa 336 20 176 16 3 61 216 
*275—"300 (12) | 118 1745 62 543 43 536 26 | 180 48 114 34 155 
*300-— 325 (13) | 124 1924 92 916 73 | 1020 43 346 55 135 22, 93 
*325—-850 (14) | 117 1764 | 109 | 1120 56 828 49 434 60 163 14 55 
"3850— ‘375 (15) 79 1261 70 805 61 845 36 Sh} 53 142 5 | 45) 
‘375—'400 (16) 43 731 60 616 28 445 26) 235 19 47 ik &| 5 
*400—*425 (17) 34 535 28 284 115) 192 8| 76 14 39 = — 
*425—'450 (18) 19 295 21 236 12 203 6| 47 9} 28 = = 
450— ‘475 (19) 23) 402 19 216 11 141 4 31 1 31 = = 
*475—*500 (20) 16 323 16 151 8 118%} 1 | 14 8 28 22 _ 
‘500— 525 (21) 13 246 8 92 ll 174 — | = 3 8 — == 
"525 —°550 (22) 6 94 2 25 1 10 = —- 4 10 = = 
550—575 (28) 3 57 2 25 1 15 = — 1 7 = — 
‘5T7T5—:600 (24) 2 48 |) — — 1 18 = — 2, 7 — == 
Totals 680 | 10702 | 530 | 5376 | 361 | 5069 | 224 | 1890 312 809 237 858 
TABLE VII. 
Number of Pods and Weight of Seed in Working Scale of Weights. 
Coefficient of : F 
2 3 Regression Straight 
Series Correlation and : : 
Prokable Biror Line Equation 
NHH ‘177+ °017 p= 738341017 wv 
NHHH "145+ :019 p= 6°719+0°562 wv 
NHD 129+°018 p= 2396 +0°166 w 
NHDD 2 te Ol9 p= 2°862+0°203 w 
NDD "282 + ‘027 p= ‘469+0°404 wv 
NDDD 215+ °030 p= 1°618+0°319 w 
NDH "258 + -024 p= 1°9824+1°657 w 
NDHH 152+ 028 p= 6:412+0°606 uv 
FSS 098 + °023 p= 10°856 +0°508 w 
FSC 147 + 027 p= T7187+0'774w 
FSH "100 + ‘031 p=12°692+0°553 w 
FSHH 121s -032 p= 7°256+40°541 w 
FSD “130 032 p= 2°164+40°153 w 
FSDD 144+ 034 p= 2°32840°238 w 
USS "155 + 025 p=10'876+0°'344 w 
USC "150 + 029 p= 6453 +0°255 w 
USH "129 + :035 p=10'165+0°271 w 
OSH H 143 + 044 p= 507640241 w 
USD °195 + '037 p= 1°287+0-091 w 
USDD 241 + ‘041 p= 5385+0°284 


ON THE PROBABLE ERROR OF A COEFFICIENT, OF 
CORRELATION AS FOUND FROM A FOURFOLD 
TABLE. 


By KARL PEARSON, F.R.S. 


Let the fourfold table be 


a b a+b 


| 

| 

| 

ae, | 

a+e | b+d N | 


Then on the assumption that the frequency distribution is normal, we can by aid 
of Everitt’s Tables of the Tetrachoric Functions* rapidly find vr. I have shown in 
a paper published in the Phil. Trans. in 1900+ that found in this way 


Probable error of r 


67449 ((a+d)(e+b) ,(a+c)(d+b) ,(a+b)(d +c) 
— — — at + ne oe nate PEL: + + 
VN, 4.N? e N? ¥ Nv? 
ad — be ab —cd ac—bd)? 
+ Baap, ee — ap, OE — a, © wt ay 
1 “By 122 By 152 
where Ve = ce j, 6 t4az: hs = te end 
h—rk k—rh 
ie eer ee 
1 1 =5 3 (H+ Be 2rhk) 
Ses aes —— @ ) 
x In V1 —2 


* Biometrika, Vol. vit, p. 436, and Vol. vi, p. 385. 


t Phil. Trans. Vol. 195 A, p. 14. Owing to the carelessness of the printers my x was put as Vx 
and the last N? in the denominator as N,. 


. — 
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and h and & have their usual meaning defined by the integrals 


(atc)-(b+d)_ 1 [h_ 


1,52 
— ——| e **dz=4a,, say; 


2N V2! 0 
atk k ° 
(a +b) 7 (¢ +d) = —| e-2dz = has, say. 
2N V2ar J 0 
17,2 1 17.2 
Let H =——e-2?”, K = ——e-?” ag usual. 


= Qa f \ Qar 


Now the formula (i) above for the probable error of 7 is admittedly laborious 
in use. I have tried in many ways, while retaining its full accuracy, to throw it 
into a form involving less laborious calculations; I have not succeeded, however, 
in achieving any sensible reduction in its complexity, as long as I maintain its 
complete generality. 


Although many hundred fourfold tables have now been published, many of which 
give such small correlations that their true significance can only be settled by 
a knowledge of their probable errors, yet I find only 40 to 50 probable errors have 
so far been determined. This matter seems so regrettable that I have sought for 
a fairly easy method of determining a closely empirical expression for the probable 
error of 7 which is likely to be of service, and can be adapted easily to tables. 


I consider first two extreme cases. If h and & are both zero, or the fourfold 
division at the mean, then y,=y,=0*, 
Probable error of r 
67449 2 V1 — 1? ((a+d)(b+e)\} _ 67449 V1 — 7? @ (16ab 
a VN fe) SNe 7 VN ie } 


since in this case a=d, and b=c. 


But for a division at the mean by Sheppard’s Theorem 


b . (3 s 
Tr = cos 7 ——- = sin | —- —-—__ }, 


a+b 2 a+b 
or (sin r)/}a = (a — b)/(a+ b). 
‘sin 7 4ab __ 16ab 
Hence 1- (= Gap ae 
Substituting we have: 
Probable error of r= — 5 Vl—9r a = (a) are (11), 


if the angle of the inverse sine be read in degrees. 


Again if r=0, the probable error of r may be obtained from (1) whatever the 
values of h and k. For in this case 


ad — be =0, w= $4, Wo = 52, O= TG 
* Phil. Trans. Vol. 192 A, p. 141 and Vol..195 A, p. 7. 
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We have (b+d)//N=i(1-a), (a+c)/N=4(14+4), 
(a+b/N=}0 +4), (C+0/N=$(1—4), 
a+d _(a+b)(a+e) abe (or a) Ord) admate 


N NW N? N N 
=f(1 + a,) (1 ete Cm) a5 el —a,) (1 = a%)=4(1 + a,0,), 


since ad — be = 0 in the original population. 


Similarly : aa =3(1 —aa). 
ab—cd _a(N—a—c-d)-cd 
N?2 aa N? =e 
_& G+ oa+d) 
— N N? 


(1 +,)(1 +a) —4 (l+)( + 0%) 


and similarly : 


Hence substituting in (1) 


Probable error of 7 = ett tats (1 — 0,202") + zhao? (1 — a7) + apm? (1 — 0") 
— 40, (1 —a/)— 40,7 a”) }3 
67449 iz a 
= Fy EER Ys L — %) (1 — 0%) a'siaisioldie/e ae; eigalaiehe eels See (111). 


This can also be put in the form: 


67449 (a+b)(a+c)(d+b)(d+c) Ge) 
— WHR a ee eee : 


This is the probable error of 7 of a fourfold table when the real value of r is zero. 


Probable error of 7 


Now as (ii) and (iv) give the reducing factors for the two cases (a) when h and 
k are both zero but 7 has any value and (b) when h and k have any values but 7 is 
zero, it occurred to me that the combined product of the two would give good 
results for a considerable range of values of h andk andr, We have to note that 
(iv) for h and k zero becomes 
67449 a 
VN 2 
Hence we take as our formula: 


Probable error of 7 
SS PY my 1 ees ay vE(1+ a )4(1 —a,) VE(1+ a) 4 (1 — &) 
H K 


wan Vs 
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On 


Now it will be seen that this consists of three parts: 


(aq) VJ —7 Ne — (a0): This is easy to table for all values of 7. 


0) Ee. 


(c) vt Pots a CLS 5) 
K 
Both these (b) and (c) can be readily found from a single table rapidly formed from 
Sheppard’s Table of the Probability Integral. The entry to the single table will 
be (a+c)/N or (a+ b)/N, i.e. §(1 + @). 

Thus a knowledge of the correlation 7 and the two division percentages (together 
with Miss Gibson’s Table for °67449/V NV), will enable us by the aid of the two 
new tables to rapidly write down four factors whose product gives the required 
probable error. I have tested the form (v) against the true probable error as found 
from (i). In all cases it gave results differing only from the true value at most by 
about one or two units in the third place of figures—a result amply accurate for 


all practical purposes. 
Illustration I. 


211°25 153°75 365 
152;75 |) 560'25 713 
364 714 1078 


The correlation was found to be °5557 + ‘0261; the probable error from the short 


formula was ‘0265. 
Illustration IT. 


1562 42 1604 
383 94 477 
1945 | 136 | 2081 
The correlation was found to be 5954 + °0272; the probable error from the short 
formula was ‘0293. 


Illustration ITT, 


455 622 | 1077 
599 | 1324 | 1923 
1054 | 1946 | 3000 


The correlation was found to be "1811 + ‘0210; the probable error from the short 


formula was ‘0199. 
Illustration IV. 


849 665 | 1514 
205 | 1281 | 1486 
1054 1946 | 3000 
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The correlation was found to be 66383 + ‘0132; the probable error from the short 


formula was ‘0132. 
Illustration V. 


1196 | 223 | 1419 
318-1263 | 1581 


1514 | 1486 | 3000 | 
The correlation was found to be ‘8464 + 0079; the probable error from the short 
formula was ‘0079. 

These examples will suffice, I think, to give confidence in the formula and in 
the tables accompanying this paper. The absence of probable errors from the 
expressions for fourfold table correlations can no longer be justified on the ground 
of their great laboriousness. 

The following Tables have been calculated by Miss Julia Bell. 


Let y,='67449/VN. This is given by Miss Gibson’s Tables, Biometrika, 


Vol. 11. p. 387. Let rs. 
—— sin r\? 
x= Via 4/1 -( ac 


and xe= VEC +a) x Fa) 
Then Probable error of 7: =.y, ayneNa aXe 


TABLE I. Values of yx, for Values of r. 


r Xr r Xr a Xr te Xr Ue Xr 
00 1-0000 20 OT “40 "8845 ‘60 "7298 “50 4843 
“O01 9999 i ‘9688 thd Si8o “61 -7200 “81 4687 
‘02 9997 22 9657 “42 8723 62 "7099 82 "4526 
03 9994 “23 9625 43 ‘8659 63 6997 "83 "4362 
‘OL “9989 “24 9591 “Ad "8594 G4 6892 “8h “4192 
‘05 9982 25 9556 45 "8527 65 6785 85 4018 
‘06 “9975 26 9520 “YC "8458 “66 6675 86 3838 
OF 9966 Ni 9482 “47 "8388 67 6563 “87 "3652 
08 “9955 ‘28 9442 48 "8315 68 6448 “88 3461 
09 9943 -29 ‘9401 “19 "8241 69 6331 ‘89 "3262 
‘10 ‘9930 30 “9358 50 "8165 ‘70 6211 “90 ‘3057 
| ye 9915 3 9314 51 “8087 S77 6088 “91 2843 
| +12 ‘9899 Poe, 9268 52 "8007 72 5962 92 2620 
13 ‘9881 ‘83 | -9221 53 7926 | -73 5834 93 2387 
LY 9862 Bh 9172 DJ 7842 Th 5702 ‘94 2142 
15 9841 85 9122 ‘55 ‘7756 75 5568 "95 ‘1882 
‘16 9819 “36 9070 56 7669 76 5430 “96 "1605 
a7 ‘9796 BY 9016 ‘SY "7579 “77 “5288 “97 "1305 
“18 ‘9771 88 “8961 ‘58 7488 78 5144 ‘98 0972 
“19 9745 “39 "8904. ‘59 7394 79 “4.995 “99 "0585 
1:00 0000 
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TABLE It. 
Values of xa for Values of $(1+ a). 


3 (1+a) Ra 4 (1+a) Ge 3 (1 +a) G3 4 (1+ a) Xa 
50 65 1°2877 ‘SO 1°4288 ‘95 2°1132 
wil 66 1:2928 “S1 14457 ‘96 2°2740 
Doe || 67 1°2984 “82 1°4641 97 25071 
‘58 68 13044 83 1°4844 ‘98 2°8915 
‘S4 69 1°3109 “Sh 1°5067 ‘985 3°2097 
20) 1°2569 70 1°3180 "85 NES Silty “990 DulooD 
56 1°2585 71 1°3256 ‘86 175590 ‘991 3°8854 
“57 1:2604 72 13338 87 1°5897 “992 40639 
58 1°2626 73 1°3427 ‘SS 1°6245 ‘993 4°2784 
59 1:2652 V4 1°3523 89 1-6640 994 4°5419 
‘60 1°2680 5 1°3626 ‘90 1°7094 ‘995 4°8779 

Of) eae 6 1°3738 91 1°7623 ‘996 5°3278 

2 1:2748 seme ic 3 859 “92 1°8249 ‘997 59776 
63 1°2787 ‘78 | 1°3990 ‘93 1°9003 “998 70465 
‘64 1:2830 ‘79 1°4123 ‘4 1:9937 ‘999 93870 


MULTIPLE CASES OF DISEASE IN THE 
SAME HOUSE. 


APPENDIX TO PAPERS IN BIOMETRIKA,-Vou. vin. 
-p. 404 AND p. 430. 


By KARL PEARSON, F.R.S. 


I REGRET that a most careless algebraic slip has crept into my work on this 
subject. It stared me in the face when I saw the published number of Biometrika, 


and I cannot understand how it escaped me in MS. or proof. 


After obtaining the fundamental equations 


StD.D yh 

Op Ti hn Se 5 PEE POR A Seton 33309 Sco (xvi), 

Oy, = Ds ( — | PEPE Rn Ar SS G8000 (xvi1), 

where N=n (1 + =) ne oo dnt soot (xvill), 


on p. 410, I continued on p. 411: 
“Now let us write #,=s*p;; then clearly o*, =s?o*, ....” Clearly nothing is 
would then equal s*o*, and aes so, ,and thus the value of y?, 1.e. 


Ps 
sie es (ps — Bs) 
; Ds 


more false! 0°, 


IP 


is erroneous. 

It is not possible I think to obtain the value of y? in this indirect and brief 
method. We must return to the multiple correlation formulae of my memoir in 
the Phil. Mag., July, 1900, p. 161, and evaluate the determinant R and its minors 


for the special values (xvi) and (xvii). 
Following the lines of that paper let us write : 
PS3/N = sin? By ssn.ccss:. ce eee (xix); 
fe Nic oSs Sages 
= , 


then o> = 


and op = WN COS By Sin Bs/8 ..2000s7.0n cee eee (xx). 
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Hence from (xvi): 


N cos 8; sin 8; cos 8; sin Bs st N? sin? 8, sin® GB; 
——— — —R,» = —— ; 
t s PtPs SEN 
or Rap, See LAIN Ope tae setae aleneg ace lex outa: (x1), 


and agrees entirely with the value of 7, used in my Phil. Mag. paper just 
referred to. 


In other words the determinant FR and its minors &,; and R,, will take precisely 
the same forms as in that paper provided we 


(i) replace Kq. (x1) of that paper by our result from (xix) above, Le. 


n; = cot? 8; -55 ee ees Santee Sees vote nea ge as (xxi), 
and 
(11) remember that we must reduce the total number of our variants 
Por Pry +++ Pu 
by the two relations S(ps)=™, S(Sfig) 4 es et oe teeth neg oe (xxill), 


where m is the total number of houses and n of cases. That is to say our total 
number of frequency groups, which are not wholly dependent but are correlated, is 


utl—2Z=—u—1. 


We select as our dependent variates, which are fixed as soon as the others are 
known, p, and p,. Thus the value of the auxiliary determinant J, p. 162 of my 
Phil. Mag. memoir, is: 


J=|—-m 1 Wee recs 1 
1 -m 1 1 
1 i == 1/3 1 


1 it she 4 
If A= (1 +m) (1 + 92) (L +)». 1 + u-1): 


we find as in that memoir: 


1 1 ] , 
J (— 1) Ne ee | OL enerenerelveness (xxiv), 
Xr 
5 ND ae nnn 6 cXV 
I 9) Fm) oy 
Jn=(- 1p {,_ i I = 


+1 | l+m 1l+m 
1 = 1 - een 
1 + 4s—4 I + 544 ie 1 as 
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To evaluate these we note that 
Let il uw = : "(pss’) 
s aa = 'S (sin? 8.) = ‘s Pee, 


but S (ps?) = NV as shewn on p. 411 of my Biometrika memoir. Hence: 
1 


u—1 il 1 rey 
8 (ry) ye Poe) 
Thus we have: J =(— 1s ; 
BS ia 10,87 Pu 
Te= gy, (tena 
_—_ n—1 
J st = ( a be 


(1+ ms) +m)" 
Next as in the Phil. Mag. paper (p. 161): 
R=(-1yJ/r, 
Ra = (lye sco ean, 
Rye =(— 1)" ST s¢ cot Bs cot B;/r. 
Thus: R= pyw]N, 
ee = cos? By ( ps8? + Py w?)/N, 
Ry = cos 8; sin 8, cos B; sin B;. 


Finally we have: 


; Res = cos" Bs (ss? aF Pu uw De 1 
Koy 7 Pu * cos? By Bs 
l/s? 3? 1 
-3(F+2 =| = = 2 ar S008 4 
U\Du Ps/> Pu” Ds 
aaa Ry _ _ cos (ex: sin Bs cos By sin B; st 
Ro, oy, ~ Puv|N * ‘cos Bs sin B, cos 8; sin Py 
sey 
we pu 
Thus finally y? is given by 
u-1 R.. ; Rs 
i 25 7, ae DS i. (CD 
S Ro, (Ps Ps) + 29 Ray Gy, (Ps Ps) (prt Pr) 
u—1 ( 2 en ¥ st 
= = \i ie eS ae ee 
e ed a) (Ps Ps) eee on MPs— Po) De poh 
u— gy (Psa Dey i {“s" < f 
—— +—— {S$ s(p,—Ds)} . 
: Ps wpPu (1 (P ) 
But iS S (ps — Ps) = 0, 
1 


Vil 


therefore Ss (ps = ps) =—U (pu — Du): 
1 
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Thus we have: v= ar Pel 


In other words x? is to be found from the ordinary y? for the w+1 variates 
excluding the houses with zero cases. 1 had excluded these houses from my 
weighting formula, Le. 

eng (eB 
i Bs 
which was, however, in error. The higher multiple cases are now we see not 
heavily weighted, but we are not to use the frequency of the zero case houses in 
evaluating our y’, This is the effect of the double relation between the u+ 1 p’s. 
An additional p is cast out of our result as compared with the ordinary frequency 
problem of goodness of fit. 


I have next to consider how far this correction modifies the results obtained 
in my papers for numerical cases. 


Q . a 5) it a9 Jp , 
. ~_ 
For enteric cases we have as on p. 412 of my Biometrika paper 


, _ (3398 — 3350)? | (78-56 | (2—1) 
Mere wSN0ee er eget! ora 
= 678 + 8°643 + 1 = 10°321, 


hence we have for n’=3, P=:006, or the odds are about 166 to 1 against such 
a large divergence from chance, 


For cancer cases we have from the returns on p. 432 
»_ (26 (94 | (416)? C914) 
x 3124" 20 * 184 " -086 
= '022 + 4418 + 9:405 + 9°714 = 23°559. 


For n’=4, we have P= 0003 or the cancer house distribution is a very 
improbable one. 


If we deal with the experimental data that were obtained for probabilities on 
the same basis as the cancer statistics we have: 


a le ep il 
x =313 T9979 


= 029 + 310 + 500 = ‘839. 
Or, for n’=3, the probability is over °60. 


Thus although I made a bad algebraic slip the new values confirm practically 
the old and Dr Webb’s cancer data suggest that there may very possibly exist 
a relationship between cancer and environment of some kind. 


Since my papers in Biometrika were written, my attention has been drawn to 
a Special Report on Cancer in Ireland which was issued in 1903 as a supplement 
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to the 38th Annual Report of the Registrar-General for Ireland. On p. 34 we 
find that in the ten years 1876-1885 inclusive there were 12 multiple cancer 
houses in the City of Dublin. In all these instances in only one house did the two 
deaths occur in members of the same family. There are no cases—or at least no 
cases recorded—of more than two deaths in the same house. 


Unfortunately the paper does not give either (i) the number of inhabited 
houses in the City of Dublin during the ten years under consideration, or (ii) the 
total number of deaths in the City of Dublin from cancer. What useful purpose 
the then Registrar-General could conceive such data would serve completely puzzles 
me*. He draws up a summary of his Tables, introducing it with the words: 
“T venture to draw attention to some of the main facts which they disclose,’ and 
clause (7) runs: 


“That in some instances more than one case of cancer has occurred amongst different 
families living in the same house, or amongst successive occupants of the same house.” 


Now unless this is meant to be interpreted as a suggestion that the multiple 
houses are in excess, it must be anticipated from the mere random occurrence of 
cancer, Anyhow without further information the data on this point, as on many 
others in this Special Report on Cancer in Ireland, are wholly worthless and the 
publication does no credit to a Government Department. 


I have striven to obtain the requisite additional data from the present 
Registrar-General for Ireland. He most kindly informs me that the inhabited 
houses of the City of Dublin numbered 23,896 in 1871 and 24,211 in 1881; and 
for the Registration Area of Dublin, 34,118 in 1871 and 36,232 in 1881. All 
these data are from the Censuses of those years. The deaths from cancer in the 
Registration Area for 1876 to 1885 inclusive were 1714, but how many of these 
occurred in the City of Dublin he is not able to tell me. I presume the late 
Registrar-General must have known these deaths in order to detect multiple 
houses, but apparently they cannot now be ascertained. It is clear therefore that 
the data of the “Special Report on Cancer” must remain practically worthless, 


If we suppose the City of Dublin to have had a number of cancer cases 
proportionate to its houses we might take: 


4 (23,896 + 24,211 
Cancer Cases in City = Le aba 1714 = 1172, 


roughly 2 of the cases in the Registration Area. 


Since the middle of our period is not very far from the Census year 1881, we 


might take 
R= Ai2: m = 24,211. 


* Cancer in the Report is associated with alcohol, syphilis, smoking, etc., in an equally unscientific 
manner. Unless we know the incidence of each character or disease in the population, how is it possible 
to determine whether the association is merely due to chance or not ? 
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Whence we deduce: 


Calculated Observed 
Dp, = 11166 Dp, = 1148 
B= 270 p= 12 
P= 0-4 P= 0 


x= 87 and P= ‘013, 1. odds of about 76 to 1. 


Here our deviations are not as before towards multiple houses, but cancer 
apparently avoids a house where it has paid one wsit! It is difficult to believe 
that the frequency should be so far below a random distribution in the case of 
multiple houses, and the matter would be still worse, if there were more cases 
relative to the houses in the City than in the Registration Area. I think we may 
safely conclude that the data for multiple houses provided by the Registrar-General 
for Ireland are probably more than 100 per cent. in error, and theretore are wholly 
worthless for the purposes for which they are apparently stated. We cannot 
suppose one visit of cancer to confer immunity on a house, and the fact that the 
multiple houses are so significantly short of the chance frequency suffices, I think, 
to discredit the data and the accuracy of the methods adopted in this Special 
Report on Cancer. 
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A STUDY OF THE VARIATIONS IN THE FEMALE PELVIS, 
BASED ON OBSERVATIONS MADE ON 217 SPECIMENS 
OF THE AMERICAN INDIAN SQUAW*. 


By ARTHUR BREWSTER EMMONS, A.B., M.D. (Harvard), Boston, 
Mass., U.S.A. 


THE object of this investigation was to determine, so far as possible, the 
variation in form of the “normal” human female pelvis. By “normal” it was 
intended to exclude all pathological pelves, and to include all variations, not the 
results of disease. Our conception of “normal,” for a standard of comparison, 
should, I believe, include not only the average measurements, but also the 
minimum and maximum as well as the proportion of cases at regular intervals 
between these extremes. 


The bones of the American Indian of the earlier times, collected from various 
parts of North and South America and the adjacent islands, are said by the 
authorities on the Indian (1) to be entirely free from rhachitis, and that other 
diseases affecting the bones, as tuberculosis, osteomalacia, and syphilis, are rare. 
This statement was borne out in my series of specimens, for no evidence of these 
diseases was encountered. In an occasionally elderly specimen, however, the 
remains of an old osteo-arthritic process were found. ‘This late change could 
have had little or no effect on the form of the pelves. 


Varying conditions, exclusive of disease, such as differences of nutrition, and 
certain habits as sitting up for long periods in early infancy, carrying heavy 
burdens in youth, are the common causes and most probable influences modifying 
the shape of the pelvis. Among the many Indian tribes from which the specimens 
of this series came, these factors may have been present at times in varying 
degrees, yet it seems fair to consider them all as from a pure unmixed race, and 
thus the series should yield a true type. 


A due proportion of variations can scarcely be obtained with certainty unless 
at least 200 specimens are used. A still larger series than mine might yield 


* Awarded the Boylston Medical Prize Essay, 1912. ‘‘The Boylston Medical Committee do not 
consider themselves as approving the doctrines contained in any of the dissertations to which premiums 
may be adjudged.” 
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slight or occasional variations from the standard here set. But it is felt that 
this series of 217 specimens will form a group sufficiently comprehensive to 
include all important variations in approximately their due proportions, and serve 
for a standard of comparison with the male pelvis and the pelves of animals and 
other races of man, as well as a standard to differentiate the pathological. 


Material. The material on which this paper is based is to be found largely 
in the splendid collection of the U.S. National Museum in Washington. The 
remainder is in the Peabody Museum, Cambridge, Mass., and the American 
Natural History Museum, New York City. To the authorities of these insti- 
tutions the writer wishes to express his thanks for the privilege of using the 
specimens, which has made this study possible. He also desires to extend his 
grateful acknowledgments to Dr Hrdlicka of the U.S. National Museum for his 
kindness and advice. 


To obtain accuracy and to avoid error several special means were employed. 
A few of the specimens were somewhat injured. The exact relative position of 
these bones could not always be accurately fixed. Ina few cases careful estimates 
only could be made, but no specimens were used unless the relative positions 
of the bones could be obtained with a fair degree of accuracy. When broken 
specimens were used the measurement was marked (“a”) approximate. The use 
of the accurate “compas glissiere” for the greater part of the measurements 
also tended to reduce error. Nearly all measurements were made on disarticulated 
pelves. To hold the bones in proper relative positions the hand and the sand- 
box were found rather unsatisfactory. The apparatus seen in Plate V_ was 
therefore devised and used for the entire series. It is believed that the margin 
of unavoidable error in these measurements is not a great one. 


Measurements. Anthropologists differ widely in what they have considered the 
essential measurements of the pelvis. These differences probably arose because 
the pelvis is such a complicated architectural bony structure. 


Topinard (2) gave measurements on 207 pelves of animals and man. 
Verneau (3) made a large number of measurements on specimens from many 
different races. Turner (4), reducing the number of measurements made by 
Verneau, gave observations on specimens of many peoples collected on the 
Challenger expedition. From these I have selected only those few measure- 
ments which seemed to me to be most essential for comparison, and I have added 
a very few observations of interest in regard to one particular region, the pelvic 
outlet. 


In the last step in the evolution of man either from the primate to man or 
from one class of primate to another, whichever anthropological classification of 
man is used, the most essential change which concerned the pelvis was the 
assumption of the erect posture. The pelvis has carried on the function of 
child-bearing probably ever since a pelvis has existed. The function of weight- 
bearing, however, by the recent assumption of the upright attitude was transferred 
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from four legs to the hind two. The pelvis, now become a very important girder 
in the bridge structure, thus came to bear the weight not only of the fore part 
of the body but also of the head, and this head has since increased considerably 
in weight. With this large increase in the weight to be carried by the pelvis 
has come a marked change in the direction in which that weight is applied. 
These three factors, the change in direction, the extra weight, and the shortness 
in time since this change has taken place, would lead us to expect changes of 
form of considerable importance and with them variations, the frequent accom- 
paniment of recent change, in those parts to which this function applies. 


The typical “male” pelvis, free from the function of child-bearing, is built 
strong, high, close-knit, thick-boned, with a small cavity, the type best adapted 
to bearing weight. The typical “female” pelvis is, on the other hand, of lighter 
build, lower, more open, and with a roomy cavity, better adapted to child-bearing. 
The newly-acquired function of weight-bearing, however, tends to mould the 
female pelvis toward the male type, while the child-bearing function resists such 
changes in so far as they tend to interfere with its long-established and most 
vital function of child-bearing. To see how nature arbitrates between these 
opposing forces, in other words how with this change the function of child-bearing 
is preserved, was one of the reasons for selecting certain measurements. I have 
aiso included those diameters and indices with which to compare various groups 
and races, 


Modern pelvimetry, as employed in obstetrics, regards the “obstetric con- 
jugate,” * the shortest distance between promontory and pubic symphysis, as the 
chief factor of pelvic efficiency. Secondary in importance to this diameter and corre- 
lated with it, is the breadth of the pelvic “inlet,” as shown by the greatest trans- 
verse diameter. A comparison of these two dimensions indicates roughly the shape 
of this space. Lastly, it has been recently emphasized that the pelvic “outlet ” 
is a not infrequent cause of difficulty in labour and, if contracted, its efficiency 
may be gauged by the inter-tuberal diameter correlated with the space posterior 
to it. The more exact estimate of these two spaces, so essential to child-birth, 
and the means of calculating them in the living, are the direct objects of certain 
of the measurements taken. 


Inlet. In life direct mensuration of the “ obstetric conjugate” (Diagram I, A—R, 
and Plate II) or the conjugata vera cannot be satisfactorily performed. Calcula- 
tions in life from the “ external conjugate,” the distance from the tip of the spinous 
process of the last lumbar vertebra behind to the top of the pubic symphysis in 
front, as introduced by Baudeloque (5), though often suggestive are admittedly 
unreliable. We have, however, a much more reliable measure for calculation of 
this vera, which is the oblique or diagonal conjugate (Diagram I, A—P), extending 
from the promontory of the sacrum to the lower border of the symphysis, as 

* The ‘‘conjugata vera” of the text-books is taken from the promontory of the sacrum to the top of 


the pubic symphysis and is a variable amount longer than the ‘‘ obstetric conjugate,” consequently I 
prefer this shorter measure though both terms are used synonymously. 
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measured in life without difficulty in small pelves by means of two fingers through 
the vagina. Tocalculate the vera after obtaining the diagonal conjugate the inner 
surface of the pubic symphysis is palpated and two points observed ; first, the height 
of the point nearest the promontory above the symphysis (Diagram I, R—P); 
second, the angle which the line on which these two points he makes with the 
diagonal conjugate (Diagram I, R—P—-A). From these data we may calculate 
in each case the amount to be subtracted from the diagonal conjugate to give 
us the obstetric conjugate or vera (Diagram I, dP -NP=AR). This difference 
(VP) was calculated in the dried specimens, and found to vary from 0°8 cm. to 
3°2 cms. in my series. 

The following table gives the data used in the living for estimating the 
“obstetric diameter” or the conjugata vera. The pubic height was taken from 
the point on the pubic symphysis nearest the promontory, as an upper limit, to 
the point on the pubic symphysis nearest the tip of the sacrum, as the lower limit, 
which space may be called the length of pubic resistance (see Plate VI, 6 and 
Diagram I, R—P). 


Oblique Diameter | 5 Obstetric Diameter = Ditference 2 Pubic Height 2 
= fy ey ry 
9°4 to 9°9 cms. 4 75 to 8'°9cms. | 14 | 0°8 to 1:4 cms. | 46 | 1*1 to 1:9 cms. 9 
10 tol09 ,, 17 9 ton 99, 60 | 15tol9 ,, 98/2 to29 ,, 109 
Il toll9 ,, TAN HOS to lOs9), Tay | PAY oy Pa 58) | 3 to43 5, 95 
12) to 12°9 ,, To | lal tonlie9 54 | 2:5 to 32 ,, 14 broken 4 
13 to 139 ,, 38 | 12 tol12°9 ,, 10 — — — — 
14 tol49 ,, 10 | 13 to 14 . 4 — — —- - 
Average 11°78 cms. 10°68 cms. 1°76 cms 2°8 cms 
Maximum 14°9 __—,, 14 x Bi 43° ,, 
Minimum 9°4 _,, oD! 55 (Ot) Hie ess 


The antero-posterior distance being thus estimated, we turn to the measure 
of the width of the inlet, the transverse diameter. This distance cannot be 
satisfactorily measured in the living. Estimates of this diameter made in the 
usual way from the maximum inter-cristal and inter-spinous diameters are suggestive 
but unreliable, as demonstrated by Scheffer (6), who reported a difference of 
33 cms. in the inter-cristal measures of two pelves with equal transverse diameters. 
My series contains two rather narrow pelves (#62 and #200), the comparison of 
which bears on this point; one a generally contracted or “pygmy” pelvis, the 
other simply narrow. Three pelves, moreover, with nearly equally broad inlets, 
show wide variations in their inter-cristal diameters :— 


l 

| 762 crests 20°8 cms.| spines 17°2 transverse 10°7 | obstetric diameter 8°5 

$200 ” 23 ” ” 19°5 ” 10°3 ” ” 11 

© 2304, 25°, | 4, BBR » 14 : » 98 
ie a oe OO = 25 _ Lael “ . 12 

Weatdas, 4 Bos gl, | 22 ns 14 3 5 9°2 
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It may be seen from this table that #200 with inter-cristal diameter over 
2 cms. broader than #62, yet has a narrower transverse diameter. Again #112, 
with a transverse diameter practically identical with #30 and #143, has an inter- 
cristal diameter 3°6 cms. greater. By these extreme differences we are forced to 
conclude that accurate estimates of the transverse diameter of the inlet cannot 
be deduced from inter-cristal and inter-spinous diameters. 


Since we cannot measure satisfactorily the transverse diameter in the living, 
we are therefore forced to adopt for obstetric use some such rough rule as 
Williams (7) gives as follows :—* Despite many inaccuracies, the external measure- 
ments are of considerable value, in that they serve to indicate with tolerable 
certainty the variety of pelves (contracted) with which one has to deal. Normally 
the distance between the spines is 25 to 3 cms. less than between the crests: 
but in rhachitic pelves, owing to the flaring of the iliac bones, this proportion 
becomes deranged, and the two measurements approximate one another in length, 
the former frequently being equal to, and occasionally exceeding, the latter. If, 
however, both measurements are considerably below the normal, but preserve their 
usual relation to one another, and at the same time the external conjugate is 
also shortened proportionately, it is permissible to conclude that the entire 
pelvis measures below normal in all its diameters, or, in other words, is generally 
contracted.” The following summarising table of these measures may make the 
results of this series more graphic :— 


ot : a o 
Inter-crisial 2 Inter-spinous 2 | Transverse of Inlet | 2% 
Py a | Fy 
| | 
20°8 to 21‘9 cms. | 2 17°Z to 18°9 cms. 2 | 10°3 to 10°9 cms. 2 
22 to23-4 ,, | 10 19 to 20°9 ,, PAY || SME ato) IES) — 14 
23°5 to 24°9 ,, | 42 91 to22-4 ,, 67 | 12 tol12°9 ,, 71 
25 two26'4 ,, | 88 22°5 to 23°9- ,, 67 | 18 to13°9 ,, 108 
26:5 to 27°9 _ ,, 59 | 24 +0254 ,, 44|14 tol47 ,, 22 
28 to 2971 ,, 16 | 25°5 to 27 3 9 — — 
| 
Average 25°76 cms.) 22°66 cms, | difference 3:1 cms. 12°95 cms. 
Maximum 291, | 27 ~ of Cxiow, Wr gy 
Minimum 20°8 ,, i722 ss = 1 5 10°3 


While the average difference between the inter-cristal and the inter-spinous was 
3:1 cms., the variation between the extremes (1 to 6°75 ems.) of this series free 
from disease is wide and would tend to indicate that little reliance could be 
placed on deductions from such a comparison. 


obstetric conjugate (vera) x 100 
transverse diameter 


Superior strait Index = 5 


average 79°5, maximum 107°7, minimum 6155. 


These data were considered sufficient to give as accurate a measure of the inlet 
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as possible. For simplification the oblique diameters of the inlet, which cannot 
be measured on the living, were not included. 


Outlet. The pelvic outlet is commonly of very secondary importance as 
compared to the inlet. The space is a complicated one of irregular outline, 
and has received much less attention for these reasons. Occasional extreme 
contractions of this space have been recorded, usually, however, only after 
disastrous results of labour have occurred. The subject has lately been thoroughly 
reviewed historically from an obstetrical standpoint by Williams(8), who gives 
also his clinical observations on 1200 women. In analyzing this complex space 
a few special measurements were taken in the hope of rendering more simple 
and more exact the observations necessary for its estimation (see Plates III, VI, 7, 


VIL, 8, 9, 10, and Diagrams I, II, and ITI). 


Dracram I (traced from Plate VI, 7, pelvis #106). 


Diameters of the Inlet. Planes of the Outlet. 


A. Promontory. 
R. Nearest point of pubic symphysis to promontory. 
A—R. Inlet, obstetric diameter, or vera. 
12 Nearest point of pubic symphysis to sacrum, 
R—P. ‘Height of Pubic Resistance.” 
A—P. Diagonal conjugate. 
N—R. Perpendicular from diagonal conjugate to obstetric conjugate. 
N--P. Amount subtracted from diagonal conjugate to obtain obstetric conjugate. 
S. Tip of sacrum. 
T.  Tuberosity, point of impingement. 
S—P. Antero-posterior diameter of the outlet. 
O—T. Perpendicular from antero-posterior diameter of outlet to inter-tuberal diameter. 
S—T. Posterior sagittal diameter. 
P—O. Pubic symphysis to foot of perpendicular. 
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Pubic symphysis 


Maxi ; ae \ , Maximum 
_Maximum  } /Average inter-tubers = 9°79 cms. | inter-tubers 
inter-tubers * I a FE AEE Fy »{%= 12°75 cms. 
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‘e Minimum inter-tubers = 8 cms. ‘ 


Minimum 


Maximum 
Ant-post =} A?\ 


14°66 cms. 


Diagram II. Outlet. 


Circles represent the average fetal head circumference, 9:5 cms. 
Average inter-tuberal diameter 
Minimum 35 » ttt ; 
Maximum 5 a5 indicated. 


This diagram shows the effect of the narrowing of the inter-tuberal diameter. 


A contracted outlet, as is clearly pictured in Williams’ article, means a 
narrowing of the space between the tuberosities, accompanied usually by the 
so-called “male arch,” combined with a short antero-posterior diameter. The 
obstruction caused by this narrowing may be greatly increased by a forward 
position of the tip of the sacrum, or may be decreased by a backward position 
of that prominence (see Diagram III). The determination of the. inter-tuberal 
diameter is, therefore, of prime importance, that of the “posterior-sagittal ” 
(Diagram I, S-—T7'), the distance from the tip of the sacrum to the inter-tuberal 
line, is secondary to it and correlated with it. 


One of the first difficulties met with in estimating the outlet was the 
determination of the points on the tuberosities from which to measure. Owing 
to the soft parts this difficulty is even greater in the living. While in some 
pelves the angle of the tuberosities was such that the points could easily be 
selected, in others the rounded and complex curves made this selection a mere 
guess. ‘To overcome this difficulty and to obtain consistent results, a circular 
sheet of transparent celluloid was marked with concentric circles 0°5 centimetre 
apart (Plate VII, 8). This sheet was applied to the inner edge of the tip of the 


sacrum, as one point, and to the two tuberosities as the other essential points 
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of resistance (cf. Diagrams II and III). The circle which just touched these 
three points was read off as representing the size of the fetal head which might 
pass that particular outlet, and the points of contact were marked as the “tuber- 
osities.” Often the circle was tangent for a short distance to the tuberosities. 
The middle of this line of tangency was taken as the point of maximum resistance. 


Pubic symphysis 


ra Inter-tubers = 8 cms. *>, 
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Diacram III. Outlet. 


Circles represent the average fetal head circumference, 9°5 cms. 

Inter-tuberal diameter of 8 cms. requires a posterior-sagittal, 7°3 cms. or more ———— . 
Normal sacral movements lengthen the posterior-sagittal, 1:5 ems. - - - - - - é 

Reduction of the posterior-sagittal to minimum, 6 cms. -—-—--—--— : 


The antero-posterior diameter of the outlet (Diagram I, P—S) was measured 
from the nearest point of the inner surface of the pubic symphysis to the tip of the 
sacrum. The normal rotation of the sacrum on its axis, usually the second sacral 
segment, may during parturition lengthen this diameter, as well as the posterior 
sagittal, a variable amount, from 15 to 2 ems.(9). Such lengthening would allow 
the head to pass the tuberosities more posteriorly ; this in turn would bring the 
points of resistance on the tuberosities a variable distance back on those curved bony 
prominences, and, depending on this curve, the points of impingement of the 
head would fall a variable distance further out. Consequently we see that the 
moving backward of the tip of the sacrum for this circular passenger enlarges 
the available space not merely directly in proportion to the distance backward, 
but more nearly by the square of that distance. The figures of the measurements 
taken do not include such increase in the outlet space from the mobility of the 
sacrum (Diagrams I and II). 


Biometrika 1x 
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The following table gives the figures of the outlet in convenient condensed 
form :— 
wae o can a : F o 
Arch (pubic) Antero-posterior | © Inter-tuberal 2 | Posterior-sagittal | 2 
Fy aa Fy 
— ‘7 a \ 
Roomy eoeunlal 9- 9°9 cms. 3 | 8 89 cms. | 40 6— 6°9 cms. 42 
Medium-roomy 4 10-10°9 ,, | 48°) (9='9:9 5, I 73 i= oe 102 
Mediu 9 92) Io t 89 | 10-109. 5, | 85°) "8= 8:Onue 62 
Medium-narrow 5 | 12-12°9_,, 58 | 11-119 _,, 16 o> 9:0 9 
Narrow w. 44 13-14°66 ,, 19 | 12-12-75 ,, 3 10-17%, 2 
Average 11°59 cms. 9°79 cms 7°56 cms 
Maximum 14°66 ,, 12°75 ,, Lee. 
Minimum 9 % 8 35 6 55 


Small outlets, specimens in which the “fetal head” scale measured 9°5 to 
9 cms. in diameter, numbered twenty. As may be seen in the diagrams, the 
backward movement of the tip of the sacrum to the normal amount would 
enlarge these outlets sufficiently to allow the easy passage of the average fetal 
head*, and can thus be considered efficient pelvic outlets. 


The ischial spines, commonly the point of narrowest diameter passed by the 
fetal head in its descent through a normal pelvis, may well be an important factor 
in the mechanism of labour, In my series the attempt was at first made to 
measure this diameter, but in so many specimens it was found that this prominence 
was broken off in part or entirely, that anything like accuracy was impossible. It 
was decided, therefore, that any figures so obtained would be deceiving rather 
than helpful, and none are given. 


An index of the pelvic outlet was estimated as follows :— ° 


inter-tuberal diameter x 100 
antero-posterior of outlet * 


Outlet Index = 


average 8426, maximum 1089, minimum 64. 


The two diameters used in this index, however, lie in different planes, and can, 
therefore, hardly be considered a true measure of this irregular exit space 
(cf. Plate VI, 7 and Diagram I, S—P and S—T). The distance between these 
two diverging planes was estimated at the point opposite the tuberosities by erecting 
a perpendicular (O—T’) with the antero-posterior diameter (?—S), as a base line, 
to the inter-tuberosities line (7). The distance from the pubes along the base line 
to this perpendicular was also measured to show how far back the tuberosities 
were placed (Diagram I, P—O). The last of these distances (P—0O) shows approxi- 
mately how far posterior to the pubes the broadest part of the fetal head may 
be forced to pass, when the tuberosities are sufficiently narrowed. The first 


* Biparietal diameter 94 cms., sub-occipito-bregmatic 93 cms., Farabeuf and Vernier. 


A. B. Emmons 43 


measure (O—T7’) represents the amount to which the birth canal is prolonged 
downward and backward, when the tuberosities are thus narrowed. 


In recapitulating our findings in respect to the outlet it is fair to say, first 
that this is a complicated space, the points of obstruction being on different and 
diverging planes, varying in divergency, thus multiplying the number of elements 
in the problem. Some of these factors I have tried to elucidate, but others, 
including the ischial spines and the variations in size, shape, and malleability of 
the fetal head, must be left for further studies on the living, and for nature’s final 
test of labour. 


By means of the “fetal head” scale the points of impingement on the tuber- 
osities have been determined with comparative accuracy on this series of dry bones. 
These points on the tuberosities were found usually to be on the inner lip of the 
tuberosity at a varying distance from the symphysis. This variation was con- 
siderable and depended on three factors, the curve of these bones, their distance 
from each other, and their distance from the sacrum. By this means it was 
also found that some pelves which appeared to have small generally contracted 
outlets were in reality quite passable. 


The ischial spines, though not available here, must be considered as probably 
an important factor in the mechanism of labour at or near the outlet. 


Finally, emphasis is laid on the importance of the space between the tuber- 
osities, and, when this is reduced, on the available space behind, measured by the 
posterior-sagittal diameter. This combination is probably the most accurate 
practical measure to show the significance of the variations of the outlet. 


The separate bones of the pelvis were measured. The length of the cnnominate 
bone was obtained by means of the graduated measuring board and block, thus 
giving the maximum distance from crest to tuberosity. The width of the ilium was 
measured with the “compas glissiere” from the anterior to the posterior superior 


breadth x 1 00 


spine. An index for the larger innominate was figured = — haghi ae Also a 
reigh 
5 


Mee height (highest innominate) x 100 
oe breadth (inter-crests) 


The sacrum owing to its position of importance and to its great variability 
is a most interesting bone. Its shape often modifies markedly the pelvic cavity. 
Its height was taken from the middle of the anterior surface of the promontory 
to the anterior surface of the tip of the sacrum. The maximum breadth was 
taken with the instrument parallel to the anterior surface of the bone. 


breadth x 100 


Sacral index = 
height 


Four “observations” were added. 1. The number of sacral segments. 2. The 
sacral curvature was estimated as “slight,” “moderate,” or “pronounced.” 3. The 
segment noted at which the curve began. 4, False promontories were noted and 
used for the measure where the cavity was involved in the measurement. 
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Summary. The features brought out by this investigation are: 1. The 
variation in the size, shape, and type of the female American Indian pelvis as a 
whole. 2. The variation in the pelvic inlet and 3. outlet. 4. The variation in 
the number of sacral vertebrae. 5. The frequency of a false promontory. 


1. The variation in size of the normal female pelvis is considerable, as is 
illustrated in Plate I, 1. 


The shape of the pelvic cavity also varies much, as is suggested by Plate IT, 2, 
comparing a flat inlet with a rounded one; and Plate VII, 9 contrasting a wide 
outlet with a narrow one. 


The “male” type or high, narrow-arched pelvis is shown in Plate III, 3, com- 
pared with the “female” or extremely shallow, broad-angled pelvis. The “ male” 
type suggests a birth canal with a small bore, long cylindrical cavity ending in a 
narrow-arched small outlet. The effect on the mechanism of labour is to increase 
its difficulty proportionately. 

The determination of the sex of a pelvis is not always easy, and in a small 
percentage (perhaps 1 to 3°/,) of cases is next to impossible, even with the aid 
of the whole skeleton. The breadth of the great sacro-sciatic notch was found 
to be the most reliable guide. The acuteness of the sub-pubic angle was next in 
usefulness. The thickness of the bones and finally the skull and long bones 
were referred to in doubtful cases (10). No specimen was included unless deter- 
mined by these signs to be female. 


2. Considerable variation is seen in the diameters of the inlet. Michaelis (11) 
reported 1,000 cases carefully studied as to pelvimetry and the results of labour. 
Litzmann (12) continued the work, reporting a second 1,000 cases. The standard 
determined by these men has been accepted throughout the world. Litzmann (12) 
considers all pelves contracted if the conjugata vera is 10 cms. or less in a generally 
contracted pelvis, and 9°5 ems. or less in a flat pelvis. By this standard my series 
of Indian pelves show 63 “contracted pelves,” or 29 °/,, as follows :— 


| 
| Generally contracted 9 to 10 cms. | 6 | Flat 9 to 9°5 cms. | 43 
| 55 > 8to 89 ,, 1 5 8) tore: | 11 
| ee, 7 ortous:O ns alee 


A pelvis was classed “generally contracted” if the transverse diameter was 
less than 12 cms., at least one centimetre below the average: with, at the same 
time, an obstetric diameter of 10 cms. or less. A pelvis was considered “ flat” 
if the transverse diameter measured 12 cms. or more with the obstetric diameter 
9°5 ems. or less. 


Three only of the seven generally contracted pelves showed the inter-cristal 
and inter-spinous diameters reduced more than 1°5 cms. below the average, and 
thus might be said to “suggest” lateral contraction within normal limits. The 
other four showed no such suggestion of contraction. 
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The flat pelves, 56 in all, show that at least slight contraction of the inlet 
is to be expected in one pelvis in every four, while moderate to serious narrowing 
of this all-important space is to be expected about once in twenty pelves. 


3. The outlet showed wide variation in size and shape. By the use of the 
“fetal head” measure it was determined that no pelvis, in all probability, was 
too small to pass the average fetal head. It was found that a narrowing of the 
inter-tuberosities diameter was the most important single factor in reducing the 
size of the available space. Next in importance was the shortening of the space 
behind this line, or in other words a reduction of the posterior-sagittal diameter, 
found with but slightly greater frequency in pelves in which the sacrum contained 
an increased number of segments. The combination of these two factors, short 
inter-tuberosities and posterior-sagittal diameters, was essential to reduce seriously 
the efficiency of the outlet. The normal movement of the sacrum, allowing the 
tip to swing backward, enlarged the available space considerably. The diameter 
of the ischial spines, though probably of great importance in the efficiency of 
a pelvis, was not available in this series of ancient pelves. 


4. A numerical variation of the sacral vertebrae was noted in 47 pelves 
or 21°7 °/,, or one in every five pelves. The number of segments ranged from 
four to six (see Plate 1V). By an increased number of segments is meant sacra 
in which the six segments were all sacral in character, or those in which there 
was a transitional vertebra, whether lumbar or coccygeal in character. Such a 
classification, as was pointed out to me by the late Professor T. Dwight, is more 
practically useful than anatomically correct. Pelves with small outlets were 
found slightly more often among those with an increased number, and it is 
possible that this increase in segments may be one small factor in reducing the 
size of the outlet. Aside from the possibility of slightly infringing on the outlet 
space, in rare cases the numerical variation appears to play no important part in 
the variation in size and shape of the pelvic cavity (Plate VII, 9 and 10). 


5. False and double promontories were found in twenty pelves, about one 
in every eleven pelves. These false promontories varied from a marked prominence 
of the second sacral vertebra (Plate VI, 6), an equal prominence of the first and 
second sacral segments, to a projection of the top of the last lumbar vertebra 
beyond the sacrum nearer to the pubes (Plate V, 5). In nearly all cases false 
promontories were associated with transitional vertebrae or an increased number 
of sacral segments. The apex of the lumbo-sacral bend falls at a point pro- 
portionally distant, in all probability, from the sacro-iliac attachment or the 
“vertebra fulcralis” of Welcker (13), and in these transitional cases this distance 
brings the point on another vertebra than the usual one. False promontories 
occurred in four specimens classed as moderately generally contracted and in one 
with a small outlet, but in no other “contracted” pelvis. That is a little more 
often than the general average, but in all probability has no special significance. 
From the general appearances as well as from the measurements it seems fair 
to say that the false promontory has no appreciable effect on the pelvic cavity. 
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Congenital dislocation of the hip was found in one specimen (#10). The 
displacement, of the right femoral head only, was upward and backward where 
a new socket was formed on the ilium. The deformity produced a tilt to the 
pelvis of approximately 10 degrees downward to the right. A new acetabulum 
was formed, and the femoral head and neck showed practically no change. As 
will be seen by the measurements no alteration is found in the pelvic cavity in 
spite of this deformity. 


In conclusion I would emphasize that a “normal” standard for the pelvis 
should include not only an average, but also a series of measures graded from 
minimum to maximum. A large series would seem necessary in order to include 
the wide variations found among non-diseased pelves. Owing to these great 
differences found among the specimens of such a comprehensive series, less stress, 
I think, should in future be put by the anthropologist on those small differences 
seen in a few or even in a small series of pelves of separate tribes or races, but 
rather the close resemblances should be emphasized in contrast to the great 
differences noted between animals and man. And the changes might be further 
studied and traced which have followed the assumption of the erect attitude. 


Such changes as the differences in the sexes are receiving the attention of 
the embryologist (14). The anatomist is studying the variations of the spine 
and their significance. The obstetrician is constantly seeking more light on 
the interpretation of peculiarities of this complex bony structure. It was my 
desire, therefore, to suggest to the anthropologist an application which may be 
made of his work of collecting and classifying, identifying and interpreting such 
valuable material as is here used, and also to direct his attention to the applied 
anatomy and physiology of the pelvis: that is the function of weight-bearing 
and child-bearing. 


The popular idea has been that the function of child-bearing among the 
American Indians was always efficient and easy. Engleman (15) confirms this 
idea by saying that labour as a rule among North American Indians is short 
and easy, averaging two hours. As civilization is approached, however, labour 
becomes more extended. Thus half-breeds, as the modern Mexican Indian, 
average three to four hours. He further states that accidents during labour 
are rare when women do not marry out of their tribe, for the child’s head is 
in proportion to the pelvis. But deviation from the natural state, he continues, 
brings difficulty. The example is given of the Umpqua tribe, who have inter- 
married with whites and have died, it is stated, during labour in consequence 
of the disproportion between the larger head and the ordinary pelvis. When 
the father of the child, however, was also an Umpqua no such trouble is known 
to have occurred. 


My series of dried specimens bears out in general this clinical evidence, for 
most labours might well have been short and easy so far as the pelvis was 
concerned. About one-fourth, however, would require a rather smaller fetal 


Plate | 


Biometrika, Vol. IX, Parts | and Il 


‘xnoig ‘JOT B ‘adreq—"euoziay “Zo F ‘Treug—eztg Ul UOTVNeA 


‘T 


Plate Il 


Biometrika, Vol. IX, Parts | and II 


oyppurmog ‘en 2 “uyq—'Ayonquey ‘9g 2 ‘poepunoy—edeyg ur uo, 


%G 


Plate III 


Biometrika, Vol. |X, Parts | and Il 


“29 Ured ‘GE 


) 


Cad 


- 


.co[eula,y,,—"aenbs ‘aqovdy ‘of Z .SoTey.,—‘aday, ur vores, 


Plate IV 


Parts | and Il 


Biometrika, Vol. IX, 


‘squouises 9 ‘66% ‘8 ‘sjueuises g ‘Fx S "L  ‘S}UaUIHES g ‘46% ‘9 ‘syuautdes fe ‘Q9TZ -e 


te Qnqa Ba fp ~ “zr ea. Qna a aad °a 7a 1 & a od fs hee " . 7 Safe ‘ x 
sjuautses Se ‘goTt ‘fF ‘sqyuewBes Gc ‘yg ‘gE ‘syuembdes Tp “ICD SG squowmses ¢ ‘092 'T SAMI AAISSAIBOL “sPUOUIFIG [VAMVG JO SUOTPVLIVA [ROLOUIUN fF 


é 


ay 


N 


Biometrika, Vol. IX, Parts | and Il Plate V 


5, ‘False Promontory” made by first lumbar vertebra. 277. Apparatus for holding pelves (Author), 
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8. Celluloid circular scale for Outlet, dark circle diameter 9 cms., string 
marks tuberosities. Pelvis £15 California, 13,232, six sacral vertebrae. 


9. Variation of Outlet. String marks intertuberal diameter. Wide: $13, California 13,554, Intertuberal 12°75, 
Antero-posterior 14:66 cms. Narrow: $34, Massachusetts, 47,998, Intertuberal 8, Antero-posterior 12 cms. 


10. Six sacral vertebrae, all sacral in character. Roomy Outlet. Tuberosities marked. 
10-lems, Antero-posterior 1:75 ems, California, 13,232 (same as photo. 8), 
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head than the average, as found in civilized countries and peoples, in order for 
labour to be short and easy. About one in ten would require a small and 
perhaps malleable head to make successful labour possible. Observations on 
the heads of new-born pure-blooded Indian infants as to size and malleability 
would be a welcome supplement to our present knowledge of pelvic efficiency. 


(1) 


(2) 


(3) 
(4) 
(5) 
(6) 
(7) 
(8) 


(9) 
(10) 


(1) 
(12) 


(13) 
(14) 


(15) 
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48 A Study of the Variations in the Female Pelvis 


Table of Pelvic Measurements of 


INLet PELvic 
Peabody | Tuter: — ae ss 4 
No. | Museum | Tribe or ryests | Lnter- a Sig ea® S apa) a oe i) 
No. Locality wae: spines| 2 2.5 i: Se 9g x 18.8 = es 4 23 ae 
cms. n = Gg | o 5 3 ns} Xo g = rd Va vs 
a |28)28 | 8. | 4 |S see 3 | 82 |e 
| H O88 2° a <q 24 i) ~ ay || aa 
| i= aXe) ey q S) 
| =] Ay 
dt |) MILKS} 7A0) Tennessee 25°5 | 23°25) 1371 | 12-7 | 11 Se 84 12°5 | 10°75) 9 73 | 7 
2 | 57,506 Ohio | 27°25 | 25°25) 13°5 | 12°5 | 11°25; 1:25) 81 10°8 | 10°5 9 83 | 4:5 
3 | 26,630 2» | 26 23°75) 12°56 | 13°25) 11°75 | 15 94 11 9°5 81 68 | 5:3 
4 | 26,989 7 26°5 | 23°75 | 12°75 | 11:3 9°5 18 745) 11:6 | 12 10°3 83 | 4:5 
5 | 13,551 California 24 20s 13 12°2 | 11:2 il 86:2] 10°6 | 11 9°2 77 14 
6 | 13,238 | ” 93°75.) 20 = W1n-75 | 1175 10775) a 91-4} 11°15 | 11 10 8 . | 4:2 
7 | 13,240 5 27 24°5 | 14 11°4 | 10°1 1°3 72 ll 11 10°1 77 | 4:2 
S | 13,239 26°25 | 22°75 | 13°25 | 12°3 | 10°6 WEF 80 11 9°5 9 Cia foes 
9 | 13,286 - 25°5 | 20°7 | 13°25] 12°3 | 11 1°3 83 10°8 | 10 8°85) 74 | 4:5 
10 | 13,448 ” | 25 22, 13 12°1 | 10°4 1°7 80 10°9 | Il 9°9 76 | 4:5 
11 | 57,883 . 27°5 | 25 13°25 | 11°5 9°5 2 711 | 18°75 | 12°56 | 10 105 | 4:5 
12 | 13,553 ol 25°5 | 20°56 | 13°75}|11°5 | 10°2 1°3 74:2) 13 10 grt 75 | 4°85 
3 | 13,554 +) 26°25 | 21 14°25/ 115 | 9°52 | 2 66°6 | 14°66 | 13 12°75 | 8:24] 36 
14 | 12,248 | % 24 22 12 TOR LOLS 2 90°8 | 13 12 10°35; 8 6°2 
15 | 13,232 ) 26°75 | 22°5 {13°56 | 11°5 | 9:35) 2°15] 69:3) 11°75] 10:5 | 10:1 71 | 6:15 
16 ) 13,545 - 24°75 | 22 12°75 | 12°5 | 10°5 2 82°3 | 11 10°25 | 9:9 G 5 
17 | 13,546 7 Pape 24°5® | 13 14°7 | 13 Lea LOO 12°75 | 11 10°2 75 | 56 
IS | 13,547 | "5 25°5  } 23 13 1273) | Te 13 84°6 | 11°25 | 11 10°1 78 | 4:6 
19 | 13,548 | 26°25 | 23 Sid) 22. ea 1:2 81°4 | 12 10 9 7°75 | 6 
20 | 48,009 Mexico 24°75 | 21 13°99 | 11 9:8 1:2 70°5 | 12°25 | 10°5 9°5 73 | 6:25 
21 | 57,7822 | California 27 24°25 | 14 12-2 | 10 2°2 71°4 | 10°75 | 12 Lil? 72 | 56 
22 | 58,143 Iroquois, N. Y. | 27 25°5 | 13°25 | 12°4 | 10°2 2°2 73°6| 9 10 9°8 75 | 4°65 
23 8,359? | Flat Head 27°25 | 25°56 | 14:25 | 12 10°25 | 1°75} 71°9} 1171 | 11 10°3 66 | 5:5 
24 | 57,458 New York 25°25 | 23°56 | 125 | 11°7% | 9°52 | 2:2 | \76 12 11 9°1 81 | 51 
25 2,347 IKentucky 25°25 | 22 13 12 10°5 15 80°8 | 12 11°5 | 10°9 8:2 | 51 
26 2,346 | 7 28°5 | 26°25 | 14 13°2 | 11°5 1°7 82°1 | 13 10°5 9°5 77 | 65 
a7 2,350, II] Pa 27°5 | 23 13°56 | 12°25] 10°75 | 1°5 79°6| 11-7 | 10°5 8°8 8:2 | 4:6 
28 | 11,972 Tennessee 23°5 | 22 13 13°5 | 12 1°5 92°3 | 11°75 | 11°5 | 10°2 86 | 4 
29 | 11,861 ¥ 27 24% 14 12°6 | 11 1°6 78°6 | 11 10°75 | 10°1 78 | 4 
30 | 57,512 . | 25°5 | 23°25 | 14 11°5 9°8 IS 7/ 70 12°25 | 12 10°37 | 88 | 4°6 
OL | 27,213 or | 26 24°25 | 13 11°56 | 10°25] 1°25] 78:8) 10°25 | 10 9°7 a 4°15 
32 | 32,435 Massachusetts 26 23°25%| 13 11°5 | 10 1°5 77 11-2 | 10 8°8 6°5* | 5-7 
33 | 10,262 " 23 20 12 12°5 |.11 15 83°3 | 118 9 S'1 6 6°42 
34 | 47,998 % 28°5 | 27 13°56 | 1371 | 11 2°1 80°4 | 12 10 & & 6-1 
39 | 57,779 Arizona | 25 23°3 | 13 9°8 7-75 | 2°05} 59°6} 11 10°5 | 10 65 | 5-4 
36 | 58,005 Ohio 25 22°5 | 12 12 11 | 1 83°3 | 12 12°5 | 10°8 95 | 4 
37 | 58,022 y 27°35 | 25% |12 | 148 | 14 0:8 | 107-7] 13' ~ 119°) “/5t0:3048ant5-5 
3S | 58,049 6 | 26 24 13 11-7 @) 2°7 69°2|12°5 | 11 10°42 | 6°54" | 78 
39 | 58,056 cf 2 24°25" 12°75 | 13°2 | 11°5 17 90°2 | 11°25 | 11 9°9 8 4-7 
40 | 58,023 . 2625/23 |13 |141 |13 11/100 /12 |12 /12 7:89 16 
AL | 58,057 om 23? 20°75% 12 13°2 | 10°25 | 2°95] 85°4/10°25| 9°75 | 8°8 76 | 4°5 
42 | 58,453 5 25°75 | 23°5 | 13°5 | 11°2 | 10 1:2 74 12 11°5 | 10°2 8 5°5 
43 | 58,463 5 28 25 14°5 | 12°3 | 10°5 1°8 72°4|12°3 | 12 10°6 9 5 
U.S. N.M | 
44 227,463 | Alaska 2351) 20:5) | 11-2 a2, 9°7 2°3 86°6 | 10°3 9 10°6 6 5°4 
45 225,473 | Apache 25°75 | 23°8 | 13:9 | 12°7 | 10-2 2°5 75°5 | 12°2 | 11:5 | 10-7 8:7 |6 
J6| 228361 | 2 25°5 |92 |11°5 |11-9 |10-2 | 1-7 | 88-7/10-2 | 9:5 | 8-5*| 7-2 | 3-9 
47 226,290 Arizona 26°5 | 23 13°8 | 11°8 9°5 2°3 69°5 | 12°2 9°75 | 84 71 | 6:3 
1. One sacral segment probably from coccygeal end. 3. Last lumbar articulates with left 
sacral ala. 5. Four sacral segments. No evidence of more at either end. 6. First sacral tran- 
sitional. False promontory. 7. False promontory. 10. Congenital dislocation of right hip. Two 
sockets. Inlet diagonals, right 12°3, left 12°3. 1l. Coccyx ossified to sacrum (not included), distorted. 
12. False promontory. 13. Wide outlet (Plate VII, 9). 15. Well proportioned symmetrical sacrum 
(Plate VII, 8 and 10). 17. Coccyx ossified to sacrum (not included). 20. Tapering sacrum. 


(Note: ‘‘a” means an approximate measurement often due to a broken bone. Italics signify high or low 
value or some peculiarity referred to in text.) 
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American Indian Squaws. 
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OvuTLET InNoMINATE Bony SAckuM 
a . Sige : 
w ES} ] 
3| 4 é | | q | | 
S| > | :3 = DAS |S ee eee re a |p sl 3 + 4 | 88] 23 | 
o}| 2 2 2 te of 48 | BH! gH! gs Caen oS 4 B fp | > | S| 8 
ewe |e ee ae Pe eR a lee) les) 
Ay Ay ° | | | 
eS} Medium} 80 |19°8 14 | 70°8) 19°8 | 14 — | 77-6 | 10 11°5 6 | Moderate Ist 115 
3°3 M 83-3 | 20°6| 15-2 | — | 21:1] 15°6 | 73:9] 77-4] 11:1 | 12-4 3) M_, 38rd | 112°6 
‘3/29 | Narrow] 73°6/ 19-2) 13°8 | — |19-4/13-7 | 706) 74-6/11°6 | 111 | 5 | Slight | Ist | 95-7 
5 |3'4 | Roomy] 88:6] 19°9| 15-4 | 77°4| 19°9 | 15-4 — |751} 99 | 11:4 3) M 1 115-1 
7 | 2:3 M 86°8 | 18°5 | 14 — | 185/143 | 77-3] 78-7] 68 | 11 4 (Pronounced 1 1746 
6 | 4:3 R 89°6 | 18°3 | 14°5 — | 186) 14°4 | 77:4) 78-3) 10°5 | 10°5 5 M | 2 100 
2°4 R 91°8}19 | 14°75; — | 19°2/ 14°65 | 77-6/ 71:1} 10°95/ 12-6 | 5 S | 4 115°1 
3°5 N 81°8 | 20°4 | 15°54 | 76 20°4* 15°35" | — | 73-9] 1071 | 1225] 5 S 3 | 121°2 
3'4 N 81:8 | 18°8 | 148 —— eles8 lo US )483 || (eine || ees ker 5) S 1 124-7 
6 |265| R 90°8 | 19:7] 13°5 | 68-°5/ 19-4} 138 | — | 74-4] 93 | 11:9 | 5 S 1 | 127°9 
oN Ae M 75 19°9 | 16 80'4 | 199) 15°6 — |72°4;} 86 | 10°95! 5 Ie 1 127°3 
75 | 2°22 M 82°5/19 | 14:4 | 75:8} 19 13°75 74:1 | 1025) 11-1 | 5 SS) 4 108°3 
2 R 87 18 14°52 | — | 18°6 | 14°52 | 72°6| 70-9} 9-1 | 12°2 9) M 2 | 134-1 
75/3 M 79:6 | 1971 | 14°88 | — | 19°8| 14°5 | 73-2} 82-5) 98 | 11°9 5 M 3 121°4 
25 | 2°8 M-R 85-9) 19°1| 146 | 76°5 | 19-1 | 14-4 — |73°6|10°6 | 12 6 M 1 113°2 
3°25 R 90 19 15°5* | 763} 19 | 14°5 — |76°8|} 9°65] 11°75} 5 M 1 121°7 
275| M so |20 | 15-2 | — | 205 | 15-2 174-1] 75-9/11°3 | 19-2 | 5 M 1 | 107°9 
2°5 M 89:7 | 18°6 | 14°4 | 77-4 | 18°64 14-42 | — | 72-9] 8:1 | 12°3 5 M 1 151°8 
cD | 2-7 M 75 18°4 | 14°6 Se Ce |e 77 71:2) 84 | 12°3 5) M 1 146°4 
25 | 2°35 M 77°5 | 19°2| 14°8 | 77:1 | 19°2 | 14°6 — |776] 91 | 111 5 iP i 121-9 
2°7 R 108°8 | 19°8| 14-4 | 72°7 | 19-7 | 14:5 — |73°3/ 96 | 12°6 5 M 1 131°2 
26 | 38 M 108°9 | 19°5 | 14:24 | 72°8 | 19°54) 14-2 — | 72°2|106 | 12°5 5 S 2 nya) 
3 M SCN ees) 15 — |19°2/ 15 TSal | 7025) 5 eO298 LZ eA 9) M 1 135°9 
6 | 3°62 M 75°8 | 18°6 | 13°7 — | 19:1) 14:1 | 73:8] 75-6|10°5 | 10°85] 5 M 1 103°3 
5 (2 R 90°8 | 17°8 | 14 | 78°7 | 17°8| 14 — |70°| 99 | 10°8 5) S 3 | 109:1 
31 M 73°1 | 20°8 | 15°5 — | 21°2]15°5 | 72:1) 74:4) 11°6 | 12°5 5 bs) 2 107°7 
75 | 2-9 M 75°2| 19°6 | 15°3 | 78:1 | 19°6 | 15°2 — eles Or a alled 3) M 1 141-6 
40} 1) 216) M 86°8 | 19-1 | 14 - 19°7 | 13°7 | 69°5 | 83:8} 9°9 | 11-2 5 M 3 | 113+1 
5 | 3:3 R 91°8 | 21-1} 15°71 — | 21°4] 15:1 | 70-5 | 79-3 | 11°25 | 12°7 OD M ae 112°9 
2°8 R | 84:1] 19:4] 15-2 | 78°8| 19°4| 15-22 | — | 76:1| 12-9 | 11:1 6 S 5 86 
5 |3°5 M-R 94:5 | 18°4| 14°5 | 73:3) 18-4) 14-5 | — | 70°8 | 18-2» | 11°4 6 S 3 86°3 
5 | 2:9 N-M 78°6 | 19:2 | 13°88 | 71:3} 19 | 13°8 — | 73:8 | 108 ie7/ i) M ] 1 
5 | 34 N 73°6 | 18 12 a its} | 13°8 | 75°8 | 7971 | 10°6 | 10°3 5) S 1 Sil 
35 | 3-4 N 66°6 | 20°5 | 16 78 | 20°5 | 15:8 — |71°6/ 102 | 12:2 5) M 3 119°6 
dT |85) M 90°9 | 18-7 | 13°8 — | 188) 14°1 | 75 7522) 9°) 14 | M L 120 
2°6 R 90 18°6 | 15 — | 18°8/14°5 | '77-2/ 75:2] 11:1 | 10°9 5 S 1 98°2 
75 | 32 M 79°2 | 22-2 | 16°5 — | 22:5 | 16°5* | 73:3] 81:8] 11°1 | 12°6 9) iP 1 113°5 
5* | 3'1 M 84:9 | 19 13°5* | — | 19:2) 135 | 70-3] 73°8} 10°71 | 11:2 | 5/6 S 3 | 110°9 
5 | 2°8 M 88 20°2 | 15°2 — | 20°7 | 15:2 | 73:4) 76°7| 10:2 | 12-2 5 M 2 119°6 
2°6 R 100 21 15°5 — | 21°3) 15-6 | 73-2} 81:1] 11-71 | 12 5 M 3 108-1 
co | 259 N 85°8 | 18 132 72-2 | 18 | 132 — |78°2} 88 | 11:7 5 M | 4 | 1383 
ey || eASz M 85 19°8 | 15:4 | 77:8 | 19°8 | 15-2 — |76°9] 12:1 | 11°3 6 M 4 93°3 
5 | 33 R 86°2 | 21°3 | 16 | 75°1 | 21°2 | 16 — |761 > 11 13°7 a) | M 3 | 124°5 
Pe | | 
2°6 N 79°6 | 19°5 | 14:5 | 73°8 | 19°5 | 14°5 — | 84:4} 10°35) 10°25, 5 S | 2 | 99 
2°7 M 87272). 21°0 | 15:6 | — | 21-2 | 15°75 | 74°3.) 89:3. | 7:7 | 19-2 a) B eal 158-4 
5 | 35 N 83°3 | 2071 | 15 74:6 20 15°1 — |78°8|) 9°5 | 10°8 i) M 3 113°7 
7 | 3:0 N 68°9 | 19°4] 13°2 | 68 19°71} 13°7 | — | 73°2]10°9 | 11°45] 5 ) 4 | 105 
i i] | ! 
21. Ossification, left ilio-sacral synchondrosis. 26. Large pelvis, heavy bones. 30. Six sac. vert., 
one from coccyx(?). False promontory. 31. Six sac. vert., one from lumbar, False promontory. 


33. Small pelvis, coccyx ossified to sacrum. 34. Sacral angle acute (8rd vertebra) (Plate VII, 9). 
35. Narrow flat inlet. 37. Large pelvis. 38. Left sacral ala rises to 3°1 cms. above body and 
articulates with last lumbar. 42, Six sac. vert., one from lumbar(?). False promontory. 44. Small 
pelvis. ‘“ Male type.” 46. ‘Male type” (Plate III, 3). 47. ‘Male type.” 
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Table of Pelvic Measurements of 


American Indian Squaws. 


| INERT Recas OuTLer Tynominate Bone Sacrus 
Peabody ; Inter- | allt 3 2 a —_—— g == ee 
No. | Museum Tribe or = Inter-| 2 Bolles 8 ne] 3 Halos a{ 3 a 3 } | 
No, Tecate cre Vienincs| 2 | Seles || S| Bee el Nee ieeedlas sz|2| 32 a es ly a el | s.| 2 [#8 ed 
cms. & a) 22 5 roi cekore lie on 25 | 32 SS Sa 4 |43\ 5a] 3 |se] se] 8 3 4 eS) Wests 2 ean 
2 |28|ee | 8 | 4 |sesi 3 a | 82 |38 s| |) ee | 2 |SS|22\e 188/88 |/e)/e]8 | 8 |EeI 8 |g8| 3 
a SA 155 a RA| & B a ae 2 2 = = Ae] Be | Ss le & fe} a i] im Ze 8 ifP eI 
=a } | & S) | | 
Ee | } | 
97 SaeG a : 
2 57/506 Pathe ae 5 ie ee Medium | 80 19°8 | 14 70°8 | 19°8 14 = 34) 1K) 115 6 | Moderate! Ist | 115 
3 8 15 81 | 68 | 5-3 M 83:3 | 206/152 | — | 21-1) 166 | 73:9) 77-4) 111 | 124 | 5 M 3rd 1126 
4 Pa 18 103 83 | 45 Narrow] 73°6| 19:2 | 13°8 | — | 19-4) 13-7 | 70°6 | 74°6 116 | 111 5 Slight | Ist | 95:7 
5 California 1 9-2 77 «(14 Roomy | 88°6| 199) 154 77-4 19°9 | 15-4 =| 75:1 | 9:9 | 11-4 5 M | Ls} 1151 
6 » il 10 8. 42 M 86°8 | 18:5 | 14 18-5 | 14:3 | 77°3) 78-7) 63 | 1 4 |Pronouneed 1 1746 
7 0 13 101 | 7-7 | 42 R 89:6] 18:3) 145 | — | 186) 14-4 | 77-4) 78:3] 105 | 105 5 M | 2 | 100 
8 a. 1-7 9 67 15:8 R 918/19 | 14:75] — | 19-2 | 14-65 | 77-6 | 7171 | 10°95 | 12°6 5 iS 4) 1151 
9 ‘ 13 8:85] 7-4 | 4-5 N 81-8} 20-4] 15-5" | 76 | 20-49) 15-5" | — | 73-9] 1071 | 1225) 5 s 3 | 121-2 
| 40 i 1-7 99 | 76 | 45 N 81:8 | 18:8 | 14" — | 18°8| 15 79:8 | 737 | 97 | 121 | 5 s 1 | 124-7 
11 a 2 10 105 | 45 R 90°8 | 19:7 | 13°5 | 68-5 | 19-4 | 13°8 — | 74-4] 9:3) | 11:9 5 Ss 1 | 127-9 
12 ie 13 91 | 7:5 | 4:85 M 75 | 19-9] 16 80-4 | 19-9} 15:6 | — | 72-4] 86 |10°95| 5 P 1 | 1973 
3 - 2 12°75 | 8-24| 3-6 M 825/19 | 144 | 75°8 19 | 13°75 | — | 74-1 | 10-25) 11-1 5 s 4 | 1083 
14 ” 2 10:35) 8 62 R 87 |18 | 145") — | 186 | 14:5") 72°6| 70:9] 9-1 | 122 | 5 M 2 | 1341 
15 s 2-15 101 | 7-1 |6-15| M 79-6| 1971 | 14:8° | — | 19°8| 14-5 | 73-2 | 825} 9:8 | 11:9 5 -M 3 | 1214 
16 ” 1 2 9:9 7 5 M-R 85-9] 19°1| 14°6 | 76°5| 1971) 14-4 | — | 73:6} 10°6 | 12 6 M 1 | 113:2 
7 " 1 17 102 | 75 | 56 R 90 |19 | 155°) 763/19 | 145 76:8| 9°65] 11-75} 5 M 1 | 121-7 
18 | a i 13 101 | 7-8 | 4:6 M 80 |20 | 15-2 | — | 205 | 15-2") 74:1) 75-9) 113 | 122 | 5 M 1 | 107-9 
19 | x 1 1-2 9 77516 M 89:7 | 18-6 | 14:4 | 77-4] 18°64 14-49 | — | 72-9] B81 | 123 | 5 M 1 | 151-8 
20 | Mexico 1 1:2 95 | 7:3 | 625) M 75 |184/14:6 | — | 18:7] 14:4 |77 | 71:2) 8:4 | 12:3 5 M 1 | 146-4 
21 | 57,7822 California 1 22 11:7 | 7-2 | 56 M 77°5|19-2| 14:8 | 7771) 19°2)146 | — |77-6} 91 |lld | 5 P 1 | 121:9 
Troquois, N. Y. 1 Py) Pe 9:8 | 7:5 | 4°65 R 108-8 | 19°8| 14:4 | 72°7 | 19-7} 145 | — |73°3] 96 | 126 | 5 M 1 | 131-2 
Plat Head 1 1 10:25 | 1:75 10:3 | 66 | 5:5 M 108-9 | 19:5 | 14-28 | 728 | 19°54) 14-2 | — | 72-2] 10°6 | 12°5 5 s 2 | 117-9 
New York 11-7" | 9:58 | 2:28 91 | 8:1 |57 M 911/19 | 15 — |19:2)15 | 78:1] 70°5| 89 |121 | 5 M 1 | 135-9 
| Kentucky 12 : 15 10°99 | 8:2 | 5-1 M 76°8| 18-6) 13-7 | — | 19-1} 14-1 | 73°8/ 75:6) 10°5 | 10°85) 5 M 1 | 103:3 
2,346 * 13° 1-7 9:5 | 7-7 | 65 R 90:8 | 17:8 | 14 78°7 | 17°8 | 14 = |'70:5'| 9:9 || 10:8 |) 5 Ss 3 | 1091 
2,350, IL] » 1 1°5 83 | 8-2 | 46 M 1 | 20: — | 21-2) 16:3 | 721) 74-4) 11-6 | 125 | 5 s 2 | 107-7 
28 | 11,972 Tennessee 1 15 102 | 86 |4 M 75:2 | 19°6| 15-3 | 78:1] 19:6) 15-2 | — | 71-3] 84 | 119 | 5 M 1 | 141-6 
29 | 11,861 4 1 16 101 | 78 14 M 86:8 | 19-1 | 14 — | 19-7) 13-7 | 69:5] 83:8} 9:9 | 11-2 | 5 M 3 | 113-1 
ely » 1 17 10:32 | 8:8 | 46 R 91-8 | 21-1] 151 | — | 21-4) 15-18 | 70-5 | 793) 1125) 127 | 5 M 2 | 1129 
| 81 * 1: 125 97 17 | 416 R 84:1 8/194) 15:28} — | 761/129 |la | 6 | S | 5 | 86 
32 Massachusetts | 2 11: 15 8-8 | 6:59 | 5:7 M-R 94-5 18-4} 14:5 | — | 70-8 | 13:2 | 11:4 (x | 8 3 86°3 
33 ” 2 12. 15 si |6 |640 N-M | 786 TOV | 13:80} 73-8) 108) 00-75) M it) ani 
34 | 47,998 » 13-1 21 2 8 8 |éa N 73:6 18:2] 13:8 |75-8]79:1|106 {10:3 | 5 | S 1 | 971 
39 | 57,779 Arizona : 2-05 | 59°6| 11 105 | 10 65 | 5:4 N 66°6 20°5|15°8 | — |71-6| 102 |122] 5 | M 3 | 1196 
36 | 58,005 Ohio 12 1 83°3 | 12 12:5 |10° | 95 |4 M 90-9 18°8| 14:1 |75 |75°2| 95 | 11-4 5 M 4 | 120 
87 | 58,022 ” 148 0:8. | 107-7 | 13 12 103 | 8 55 R 90 18°8| 14:5 | 77-2] 75°2]111 |10°9 | 5 Ss 1 | 98-2 
38 | 58,049 i 3 11-7 2-7 | 69:2)19°5 | 11 10-40 | 6-5 | 79 M 79°2 | 22-5 | 16°5* | 73°3| 81°8| 1171 | 12:6 | 5 P 1 | 113-5 
” |27 | 24-25" 12-76 | 13-2 1:7 | 90-2] 11°25} 11 99 | 8 47 M 84-9 19-2| 13-5 |70:3| 738/101 }11-2 | 5/6) S 3 1109 
” 5 | 23 13 141 11 |100 |12 13 12 7-80 | 6 M 88 20-7 | 15-28 | 73-4| 767/102 |122 | 5 | M 2 | 1196 
20°75" 12 | 13-2 2-95| 85-4|10-25| 9-75| 88 | 7-6 | 45 R | 100 |/21°3}| 16:6) |} 73-2 |/BU| Law 12 F | 5) ME) 8 toe 
” 23°5 | 135 | 11-2 12 | 74 |12 5 |102 | 8 |5°5 N 85:8 184 | 138 — |782} 8° |117 | 5 | M 4 | 133 
” 2 [145 | 12°3 18 | 72-4|12°3 | 12 106 | 9 5 M 85 19°8 | 15-2 769/121 |113 | 6 M 4 | 93:3 
U.S NM. R 86-2 21-2) 16 = |} Gp) it 13-7 | 5 M 3 | 1245 
U.S. N.M. | 
44 | 227,463 Alaska }20°5 | 11-2 | 12 9 2° 6 ? 0 F , 
45 | 225,473 Apache 23°38 | 13:9 | 12:7 108 26 15 ioe 11°5 OY o7 an N 79:6 | 19°5 | 14:5 | 73°8 | 19° | 14-5 B44 | 10°35] 10°25} 5 | S) 2 |99 
46 | 228,361 ” 22 11°5 |11°9 |10-2 | 1:7 | 887|10-2 | 9:5 | 8:5*| 7:2 | 39 M 87:7] 21:0| 15-6 | —— | 21-2| 15°75) 743/823) 77 | 122 | 5 i} 2 1 | 158-4 
47 | 226,290 Arizona 23 13:8 |11-8 | 9:5 | 2:3 | 69:5/19:2 | 9:75] 84 | 71 | 6:3 N 83:3 | 2071 | 15 74:6 | 20 | 15-1 78°8| 975 |10°8 | 5 M 3 | 113-7 
5 N 68-9 | 19:4| 13-2 |68 |191)13'7 | — |73:2)10°9 | 1145) 5 S) 4 | 105 
1. One sacral segment probably from coccygeal end. 3. Las’ ar iculs i i Si 
sacral ala. 5, Four sacral segments. No ealenes of more at ba uast lam 2 arhiculates eT 21. Ossification, left ilio-sacral synchondrosis. 26. Large pelvis, heavy bones. 30. Six sac. vert., 
sitional. False promontory. 7. False promontory. 10. Congenital didlocationtoR Tia hthy Same one from coccyx(?). False promontory. 31. Six sac. vert., one from lumbar. False promontory. 
sockets, Inlet diagonals, right 12-3, left 12:3. 11. Coccyx ossified to sacrum (not included) atctorted 33. Small pelvis, coccyx ossified to sacrum. 34. Sacral angle acute (3rd vertebra) (Plate VII, 9). 
2 5 Narrow flat inlet. 37. Large pelvis. 38. Left sacral ala rises to 3°] cms. above body and 
False promontory. 44, Small 


12. False promontory, 13. Wide outlet (Plate VII, 9). 15. Well proportioned symmetrical sacrum 


(Plate VII, 8 and 10). 17. Coceyx ossified to sacrum (not included). 20. Tapering sacrum. lates with last lumbar. 42. Six sac, vert., one from lurnbar(?). 


“Male type.” 4G. ‘Male type” (Plate ILI, 3). 47. “Male type” 


Biometrika 1x 


UL 
pelvis. 


(Note: “a” means an approximate measurement often due to ies signi i 
app a b 
value or some peculiarity referred to in text.) broken, bene: Malice Oe" eran oma 
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A Study of the Variations 


in the Female Pelvis 


Table of Pelvic Measurements of 


| Inner PELYIC 
| . Inter- : an 5 
| No. cms. | & So | so | 2 Se aes = gS | as 
| | | Be | ee |S | sls = | e233 
| A /oa|s8|/4|" j488| 3 | = [ae lee 
Ll AY 
4S | 213,331 Arizona 26°1 | 22-7 |12°8 | 11 9°3 | 1°7 | 72°6 | 10°6 9°5 9°5 6°75 | 5:3 
49 | 229,363 ) 28°2 | 26°0 | 14°7 | 12°2|}10-2 | 2 69°4 | 13 13% TEAS || 3A) PT 
50 | 239,201 5) 27°56 | 25 14:3. | 12°2|10°6 | 1°6/74:1/12°6 | 12 '10°9 82 |6 
| 51 239,202 $i 26 22-5 11383 Well | SS:O0 eZine? 12°3 | 11 10°5 66 | 65 
| 52} 239,203 " 28 23°5 | 13 121) 10:2 | 19) 78°5) ea a | 10°2 8:2 | 4°85 
53 | 239,204 * 29°75) 235: | LO Osa Ora Wee 7) 12°6 | 10° | 92 8 6°2 
54 239,215 55 25 | 22°25 | 13 12°2 | 10°25 | 2 78°5 | 10-9 | 11°5 | 10°7 8 5°8 
55 239,291 6 95°25 | 22°95 | 12-25 | 12°38.) 10°85 1:5 | 88:5) 10°5° | 10s5) 952 75 | 44 
56 | 239,293 5 26:25 | 249751 139) | 124 Oe lea iS 12 115 | 10°7 coat Ets) 
57 | 239,298 06 25°S | 20°5 | 13:1 | 12°5 | 10 | 2°56 | 76°3| 10-9 | 12 | 10-9 8:3 | 3°75 
58 | 239,305 25°75 | 22 13-4 | 10:8) 9:9 | 0:95)'73°9)) 11-25 (MIESea Os 87 | 4°5 
59 | 239,309 6 27 22-70% V3) | Las2a be le Ose i 12 a ele Se 75 | 5°7 
60 | 239,318 8 23 21°25 |12°3 | 10°71) 85 | 1°6 | 69°1 | 10°25 | 11 9°72 | Side | 259 
61 239,333 a 28°56 | 24°75 )14°2 | 12:1 | 101 | 2 Cle La eye tei 11:2 76 | 54 
62 | 239,348 a 20°83 | L722 | Oe Ora Br 5s lez 79°4 | 10°25 | 9°3 8:3 6°8 | 4°6 
63 | 226,292 9) 26 | 22°5 | 13 Oe Dee LSS a 710i ra alu eectes Tat 9°8 75 | 4:8 
64 | 239,385 A 94°95 | 21-5 | 19°38 |12°3:) 9:1 | 3:2) 74 | 10:6) | dose F956 66 | 5 
65 | 239,446 oh 25°75 | 20 13°6 | 114) 9 2°4|66°2|10°5 | 10°5 | 10-4 6:2 | 5°8 
66 | 239,453 5 27:5 | 24-25 | 13:7 4) 13) 10 1°3 | 73 12°3. | 11°5 | 10°3 CU |r 
67 | 239,474 * 26 | 24 13 SNE) | oe IES tos asy) ital 9°5 8°5 67 | 5:2 
68 | 255,129 Arkansas 97°5 | 25°5 |13°5 | 138°8]11°5 | 2°38) 85:2) 12:9 | 12:5 | 106 9-1 | 3°7 
69 | 258,768 i 26°5¢ | 21-5 | 12-1 | 12°8} 11 1-8*), 91 | 11-5 98 | 8 | 55 
70 | 259,301 3 25°15 | 21°8 | 13°6 | 12°8} 11 1°8 | 80-9 | 11 12 11:4 76 | 3:9 
(fl 225,253 5) Choctaw 26°9 | 24:5 |12°6 | 18 10°7 | 2°3 | 84°6] 11-9 | 10 9 65 | 66 
72 49,735 Colorado 26°75 | 23°75 | 13°38 | 112) 8-7 | 2:5) 65:4) 11 12 ll-4 | 7 4-4 
78 | 225,214 | Comanche 26°1 | 23°4 | 13 lll} 83 | 2°8|63°8) 9-4 ®) 8°5 66 | 3°7 
ES Q | | | 
74 | 248,579 saieiay ( jo7-75/21 |1a-6 | 14:3} 11:8 | 2°5| 86-8] 13-4 | 10-25] 8-75] 8-25] 6-5 
ee | 
Peabody M.| 
to 12,804 Tennessee 25 99°3 |12°8 |13°5/ 11:5 | 2 89°8 | 11°42 | 11 10°2 76 | 4°6 
| 
U.S8.N. M. | 
76 | 227,434 Ilhnois 22 21 122 |10 | 86 | 1°4] 70-5 | 10-8 9°5 92 7 5°3 
77 | 227,440 re 24:5 | 22a 12°5 | 116) 9-4 | 2°2| 75-2) 11-8 | 10 9°35 | 7 5°9 
78 227, 441 as 2G | L725) | eos ED 7 alo e84e3q| la 10°5 8°85 | Sl | 4 
79 | 227,445 oA 24°35 |20°5 |13°3 | 14 122 2 90°2 | 12 11 10°3 8:8 | 52 
80 | 227,448 ip 25 | 21-758) 121 |13°6| 11-8 | 1:8] 97:5] 11-5 | 10 CPPS) Gul 
81 | 227,450 5 93°75 | 20°753| 12:8, | 13°38) 11275)) 166 985) 11-5) 1c 8°85} 8 5'1 
82 | 225,420 Kentucky 26°8 | 23-8 | 12-8 | 11:6] 10-1 | 15 | 78-9] 11°8 9°5 8°5 a 6°5 
83 | 225,422 3 28:3 (2527 1 ssa) | Tal 9.8) esa 7 3314) 12 eee eal: 9 8 6:9 
84 | 225 "491 Ks 24°2 |90°5 |12°6 | 11°7|10-4 | 1:3) 82-8} 11-9 |11°5 | 10°85] 8:7 | 4:5 
85 | 225,423 26-3 | 93-9 |12°6 |11:9/ 10-2 |1°7| 813/106 | 9:5 | 86 | 6-9 | 6-4 
86 | 225,425 3 26°5 | 22-52 |12°8 | 13°5 | 12 1°35 | 93°3 | 13-1 | 12 10°8 8-1 | 58 
é7 ae 9,105 Louisiana 28°5® | 258 13°3 | 11 9:2 | 1:8) 69-2) 11 10°5 9:8? | 6°72 | 6:3 
SS 55,214 33 26°5 | 24 13:1 | 13°1/11°3 | 1°8| 86-2] 10-9 | 105 | 10 72 | 5:2 
8&9 ee 5 25°75 | 22 13°8 | 12°4)106 | 1°8|76°8) 12:2 | 11°25] 10 81 | 61 
90 | 216,213 Mexico 94°5% 1 19°5 | 11°56 | 138°2/ 11°71 | 2°1/ 9671} 11-2 | 10°75 | 10°2 ok acl 
91 228,925 New Mexico | 26* 24 13°5 | 12°2|10°8 | 1°4 | 80 11 11 10°35 | 75 | 4°6 
92 | 228,950 E 264 | 21:5 | 12-4 |11°9/10 |1:9) 80-6] 12°3 }11 | 10 736 
93 | 228,967 3 rah: 24 13 12°8) 10:4 | 2°4 | 80 12 10°25) 9°'1 75 | 53 
51. (Plate IV, 4) sacrum. Transitional vertebra. 53. Six sac. vert., one from coccyx (?). 
57. Double promontory (Plate IV, 4). 59. Narrow arch. Exostoses on sacrum. 60. Four sac. vert. 


Extreme “ female type” (Plate IV, 4). 
Six sac. vert., one from coccyx (2). 


68. 


canal open from Ist sacral down. 


62. Pygmy or very small pelvis (Plate I, 1). 


ale 


Coccyx ankylosed to sacrum (not included). 
73. Rather flat pelvis (Plate IT, 2). 


67. Small pelvis. 
72. Spinal 


74, Six sac. vert., one from 


A. B. Emmons 51 


American Indian Squaws (continued). 


OUTLET Innominatre Bone SACRUM 
4H ~» q | | 
oS aq oO 3 -_ | 42 
os BD 4 Ss > an ES > aq ca Polis 
=| o 3 Peet ieaeo) Sos 3 Shas 2 | & 3 | 28 S 5 i 
#e/= | 28 | 2/85/38) 5 |88/28/2/5/ 2) 2 |ee] & lke) 4 
3 B 2 a 3 Pesta a ap geA an na ay oO | 2 'S) oe A 
G) s mR Sis | ve 
ae z ap | 
eta | | a | 
| | | 
3 25 N-M 89°6 | 18°4 | 14°2 —- | 186] 13-9 | 72°2| 71:2} 9:7 | 10°9 5 WE 
35 | 34 R 90°8 | 20°4", 15°5 20°4|15°5 | 75:9] 72°3| 10°8 | 11°5* oD | S 1 
3°75 | 2:7 R 86°5|19°3/15°5 | — |19-4]15-5 | 79:9] 705) 96 | 11-6 | 5 M | 2 
3°5 | 3°5 M 85°3 | 18°7 | 14°1 — | 191] 14:1 | 73°8| 734} 11-4 | 10°8 5/4 | S (2 
4°25 | 3-2 R 89°5 | 20°2 | 15°4 | 764) 202) 15-2 | — | 721) 9 11°5 a M 1 
3°25 | 2-4 R-M 73 L857 (MSM Is? | L8s7 378 -- 72°6 | 10-9 | 10°9 6 PS) 3 
3°25 | 2°7 R 98°1 | 18°5 | 13-4 — | 185) 13-7 | 74 74 11‘7 | 10°9 5 M 1 
3°25 | 2°6 R | 87°6 | 18°8 | 13°9 — |19°1/13°6 | 71:2) 75:6; 9°99 |10°6 | 5 M 3 
3°5 | 32 M | 89-2] 18°5 |.13°8 | 74:6} 18°5 | 13°8 St 10849199. 7) VB. 1-5 S 2 
3°5 | 2°6 R 100 19°6 | 14:2 19°8| 13°7 | 69:2] 77°6/ 10-9 | 11:3 5 M 2 
4 2 R 92°4| 192 | 14 — |19 14 73°7 | 73°38] 8 ST 5 M |; Il 
3 2°7 N O4 19°4) 14:3 | 73-7 | 19°49) 14-38 | — | 71°3/10°9 | 11°5 5 M | 1 
3°5 | 2°6 R 94°6 | 17°2 | 13 | 723i) L294) 7082 | 7-257) 10-92 4 M L | 
3°59 | 3 R | 94:9) 19°8  15°3 — |19°9)] 15:4 | 77-4 | 69°38! 10-5 | 12-4 5 M 2 
2°8 | 2°7 M | 80°9 | 16 11°3 | 70°6| 15°7 | 11°15} — | 76:9} 7:9 | 9:2 5 S 3 
35 | 2:5 M | 85:1} 18-3 | 13°52 | 73:8 | 18°3| 13°54 | — | 70-4] 9-4 | 11 5 S 3 
2°75 | 3°2 M 95°6 | 18°83 | 14°8 | 78-7 | 18°7 | 14°7 — |775| 9:7 | 10°7 5 M 2 
4 | 31 M 99 |19:4/13°9 | — | 19°5] 13-9 | 70°8|75°7| 9-4 | 121 | 5 M | 1 
4 2°8 N-M 87-7 | 18°7 | 14 — | 18°7| 14:1 | 75-4 | 68 8°5 | 11°6 5 WE eee | 
3°25 | 3°62 N 77°3 | 18°1 | 13-6" | — | 18:1] 13°6 | 75:1) 69°6} 8-2 | 10°8 5 M 1 
3 3°3 R 82°2 | 20°8 | 15°25; — | 21 15°56 | 73°8 | 76°4/}10°8 | 12°7 6 M 2 | 
3°75 | 3°5 M-R | 85:2} 19-2 | 14°6 — | 19°3| 14°68 | 75°6 | 72°38 | 9-2 | 1145) 5 1? 1 
3 3°2 M 103°6 | 20°7 | 14-4" | — | 20°7} 14:4 | 69°5 | 82°3| 10°3 | 11:7 by M 1 
3°25 | 3°5 M 75°6 | 18:2] 15°3 | 84-1] 18°1| 15-2 | — | 67-°7/10°9 | 115 | 5 S 1 
25 2 R | 103°6 | 19 14°] — |19°1| 14-1 | 73-8 | 73°6 | 10 11°2 5 M 1 
3°25 | 3-2 M | 90°4| 19°1) 14°4 | 75°8 | 19 14:1 — | 72°8 | 11°8 11°3 5 M | 2 
35, | 3:8 | N Gos 219)1o7 71-3) 27 161 | — | 789 | 109 | 114 | 6 M 2 
| 
3°7 | 2°2 R 89°5 | 18°8 | 14:1 — |19 13°8 | 72°6 | 76 10°9 | 115 6 M 2 2) 
| | 
3°75 | 2°85 2 83°3 | 17°5 | 12°9 — | 17°7 | 12°75 | 72 80°4 | 7°75 | 10°3 5 P 1 9 | 
35 | 36 N-M 79°2 | 19°2 | 14 — | 19°5 | 148 1956 | (oso) 822) | ils 6 P 3 8 
3. |2°9 | N-M-| 80°4/18-49|13-9 | — | 18°4] 13-9 | 70-1|84 | 7:8 |105*| 5 | M 2 | 1346 
3°5 q M 85°8 | 20°5 | 15-2 | 74:1 | 20°4 | 15°22 | — | 84:2] 11 11:5 | 5+? M 3 5) 
3 | 34 N 79°1 | 18°6 | 14°5 — | LSt6 | TAs i O74 8:5 | 10%35)| 5) | P 3 8 
35 | 36 N | 76°9| 20-8 | 15-4 | 74 | 20°6/15-4 | — | 871] 102 | 11-4 | 5 M l 7 
35 | 3 N Wee y2el Wales | 13-5 — | 19°4) 13-5 | 69°6 | 72°4| 10°38 | 12-7 ) M 1 3 
2-7 | 31 R | f4:4) 20% | 15°5 | 77-5 | 20 15°3 — | 70°7 | 12°5 | lee |) ays S 4 ai 
25 | 2°68 R 90°8 | 18 14:6" | — | 18°6| 14°6* | 78°5 | 76°8 | Lo°2® | 11-7 | «5 M 3 ai 
3°75 | 3 M 81 19°2 | 14°7 | 76°5 | 19°24 14-7 | 73 LO Tey 2, 5 S 1 | 9 
3°5 | 3°58 R 74°81 20 |15°9 | 79°5| 20% | 15°99) — |75:°5|10°3 |10°7 | 5 M 1 | 3 
3°754 3°2 M 89°1 | 18-1 | 14°79 | 81:2] 18°14 14°72 | — | 63-7) 9» IARI |) | 3) M 2 | 4 
3 3 M 91°7 | 19-2 | 14°5 — | 19°3| 14-54 | 75-1 | 72°8| 10°5 | 12 6 M 2 | 3 
4 3} VION 20 Melo 5s) 75°74 199) 5a == 77-6 IiiS: | 12 6) |] M 2 4 | 
3°25 | 3°4 R | 91-1 | 18°9| 14 — |19°2|13°8 | 71°8| 78°4|10°8 | 10°5 5 M 3 | 2 | 
3°25 | 31 M | 94:1 | 19-4 | 13°82 | 71°1 | 19-49) 13°8 -— | 74°6/10°9 | 11°6 5 M 3 
3-4 | 3:1 M | 91:3 / 18-9 | 13-65 | 72-2] 18-9] 13-62 | — | 72-7] 8-4 |111 | 5 M 2 
3 3 M GOB WV 1U9s3) | VS"L* |.67°8 ) 19°38) W312 | — | 7: | 9:7 | 11-1 ) M 2 ee 
- io | Ss 7 
lumbar (?) (Plate IV, 4). 75. Six sac. vert., all sacral in character. 77. (Plate V, 5), six sac. vert., 
one from coccyx. Ist lumbar forms false promontory. 78. Small pelvis. 79. Suggestive of 
Six sac. vert., one from coccyx. 82. Coccyx ossified to sacrum. 83. 5th lumbar sacralized left, 
free, right. False promontory. . 86. Round pelvic inlet (Plate LI, 2). 88. Six sac, vert., one from 
coccyx (?), 89. Six sac. vert., one from coccyx(?). 92. Coceyx ossified to sacrum. 
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50 A Study of the Variations in the Femate Pelvis RBS naaONS A 


Lable of Pelvic Measurements of , z Z 
y of American Indian Squaws (continued). 


IT | a a = = = —— = 
| ee oe OvuTLEr Isnominate Bone Sacrum 
——— = g 
| Inter- u - = = = 2 |— = 
u J ri ; | Inter- re ede] g P 3 
| No. Ju.s.n.m) — Tetanny | spines] & |e] 2S] 2] y |o88] § | 2 | 8a) 25 2 3 5 5 a 3 |e ei 
| No. | 3 2 pe] Sto | ¢ 3 Seo] = Pie ae} 3 3 ae se r] mo 2 2 re es o 4 a 
| ems.  |fe|s2)/8]/ 3 |s8e | 2 | 83) es eel eh || Hew se See |e | se Vee OF liga.) 8 
| | 2 |ze|ee/8) 8 lee] s | 3 pees | ee |e eelge|e ea/eslei2e)2|2 |28| ¢ |e a 
| a = 5 2 J a | AY | tH gz | 5 & f=} 
| Gg |SA]o0/A BAS | 3 an a5 2 5 | ‘S a ly a A is a az 33 | 
| Arizona 128 au 6°75 358) NM 42 722 M 1 
z tie 7 BS eeu R 5°D | 75-9 seal 
| » 143 16 8:2 |6 Uo, ck 2 rf 5 
13-3 22 66 | 65 R 55 79°9 5 | M 2 
Hi e 2-2 5 72. 5/4 | , 2 
v 13 ly 8-2 | 4:85 M 41 73°8 B/4 5S 2 
a Vl Be les R 54 = 5 M 1 
3 2 ae hes R-M 13°9 Ze 6 8 3 
| a 1-5 ea ll tor R 13-4 74 5 Mise i 
i 17 Bey: Ie 5 R 13-9 71-2 5 M | 3 
Fs | 2-5 33 ae 32 | M 13:8 = 5 s 2 
we 0-9 7 aes 35 | 26 R 14-2 69-2 5 MM \e3 
55 17 75 | 6-7 1 |2 | ld 73-7 5 M 1 
5 | 16 a3 | 59 ee ||| aN 143 = 5 Noe ja 
53 2 7-6 | 64 26 R | 13 745 4 M 1 
2 | ea. lace 3 R 153 174 5 M 2 
” | 68 | 46 “2 5 Ss 3 
F 18 | 75) |e 27 | M | 113 = 5 Seale: 
5 32 66 I's 25 | M | 13°58 = 5 s 3 
i 24 | 62 15:8 3:2 M 148 — 5 M 2 
» 13 | | eialtee 31 | M 13-9 70°8 5 wo} 
” 19 67 | 52 ‘8 | NM vy Tod 5 M 
Arkansas 293 91 | 3-7 7 N 13°6* 75°1 5 q 
, 18 8 | 55 R 15-25 738 6 Mm | 2 
4 18 ee laa MR 146 75-6 5 P 1 
Choctaw 3 65 | 66 M 14-40 69°5 | 5 ut 1 
Colorado i i M | 153 = 5 S 
Comanche 28 | an aie R 14:1 73°8 5 u x 
Eskimo (St x : M | 44 — 5 y 2 
Michaels) { 25 8:25 | 6-5 N ep = i M a 
78 Tennessee | 25 13-5 | 11 | 2 9 or < = 2 
Pca (ate saz Pesca [so Jee HR Rn | 895/18-3/141 | — }19 [13-8 | 726/76 | 109%} 11-5 | 6 Mm | 2 
76 Illinois 14 955 9 A 
77 » 22 0 a ee ul 58 1 17°7| 12:75 |72 | 80-4] 7-75]10°3 | 5 P 1 
78 » 13 rary S| tom. (lai N-M 19°5|14" |79°6|75°5| 82 |113 | 6 P 3 
Lo, » 2 So lena inenllias lee N-M R 18-4| 13-9" | 70-1) 84 | 7-8 | 10-5" | 5 M 2 
| » 18/975] 11:5 | 10 91 17. |61 M 5° 20-4/15-29| — |842/11 | 11d [547)  M 3 
pe » 1-6] 918/113 |105 | 885| 8 |521 N | 79: 18-6 | 14-5" | 779/744] 8:5 | 10°35] 5 | | 3 
2 Kentucky 15|789]11-3 195 | 80 | > |e. N 76" 206} 15-4 | — |871}10°2 |1l4 | 5 M 1 
83 a 113/731 | 191 | 11 Mhalies lao N 72: 19°4|13°5 | 69°6| 724] 10°3 [12-7 | 5 M 1 
34 7 11-31 82-8/21-9 |a1-5 [10-851 8-7 | 45 Roo] 7 20 |153 | — | 70-7]125 | 11-7 | 5/6 8 4 
35 » 1-7 | 81-3 | 10-6 el gtecl ecaialtata R | 90: 18°6| 14:60 | 78-3 | 76-8| 10-2" | 11-7 | 5 | M 3 
86 5 TIES ES) GO) 10" uM | 19-2147 | — |73 Jiow)ue2 | 5 | § 1 
y, Louisians Laos ee eet es, 2S R 30° [15-90 | — |75:5)103 |10-7 | 5 | M | 1 
oy Louisiana 18 | 69°2| 11 105 | 9:84] 6°74 | 6:3 | 2 ote lice : 2 M 3 
a6 ee 1:8\|i6°3 hoo bowel ero ee os M | 1814 14-78 | — 087 po i ye M 2 
8! 8 2 “9 +50 | 75:1 | 72" 5 | 12 2 
ee Mexivo eal ee deal Pees | ee van toe eae Weal rae ies | M | 2 
ce New Mexico Wei eyelash al ieeseeal| ees. ll oes R 19-2| 13:8 | 71-8) 784/108 | 105 | 5 M | 3 
be 5 19|80-6]19:3 |11 |10 | 7-3 1/6 M 19-44) 138 | -— | 746/109 |11°6 | 5 M | 3 
oS » 2480 lag) \r0-2eil torn levee tetas M 18:9] 13-68 | — | 727) 84 |lbL | 5 Me | 2 
A clea to: | M 19-34 1318 | — | 715} 9-7 | ud | 5 M | 2 
51, (Plate IV, 4) sacrum. Transitional vertebra 52 i 3 ir a , hi 
oer , cru. — ‘Trans bra. 3. actvert hy t oF A mor 
57. Double promontory (Plate IV, 4). 59. Narrow arch aan: ae Vers., done: froma cocoyay) lumbar (?) (Plate LV, 4). 75. Six sac. vert., all sacral in character. 77. (Plate V, 5), six sac. vert... 
Extreme “female type” (Plate LV, 4). 62. Pygmy or very small pelvis (Plate 1,1 on Hout sac, vert, one from coceyx. 1st lumbar forms false promontory. 78. Small pelvis. 79. Suggestive of 
68. Six sac. vert., one from coccyx (). 71. Coccyx ankylosed to sacrum (not aa ded), mall pelvis: Six sac. yert., one from coccyx. 82. Coceyx ossified to sacrum. 83. 5th lumbar sacralized left, 
canal open from 1st sacral down. 73. Rather flat pelvi a ne ). 72. Spina free, right. False promontory. + 86. Round pelvic inlet (Plate IT, 2). 88. Six sac. vert., one from 
ut pelvis (Plate II, 2) 74. 8 4 3B} Pp y: p +p 
9“) . Six sac. vert., one from coceyx (1), 89. Six sac. vert., one from coccyx(?). 92. Coceyx ossified to sacrum, 


7-2 


segments (Plate IV, 4). 111. Coccyx ossified to sacrum (not included), 116. Wide outlet (bones dark 


52 A Study of the Variations in the Female Pelvis | 
Table of Pelvic Measwrements of | 
INLET Pernyic zg 
U.S.N.M.| Trib eects tater ile i 3 z & 
eo) We Ne av. ribe or erests nter- | @ o we Far a 1 
No. No. | Locality | spines | 5 s B | 2s I r 688 & 2 28 s 5 : 
ome: | 3 |e8\/e2)| € | 8 (838) See oe 
| e |22\22|8 | 2 |ae2] a | 6 | se lee\a 
| Pen (og |e SBA) 5 Bs | 1 ea 
| a i 
D4 246,968 | New Mexico | 27°5 | 23°8 | 14:4 | 13°3 | 10°8 2°5 75 12°6 | 11 10 8 5°75) 
95 225,215 Pah Ute 25°S | 22°2 | 12°7 12°5 | 10°5 2 82°7 | 13°5 | 14 ier |) ale 7h | PAS} 
96 204,254 Peru 27:3 |21°5 | 13°8 | 12°2 9°25 | 2°9 67 11°3° | 10°25 9 75 | 53 
97 225,213 3 | 26°5 | 22°25) 12°5 | 10-4 fo}? 2-4 64 11:9 | 10 8°6 81 | 5% 
98 225,250 nf 25 | 20°5 | 12°9 | 12 9°5 2°5 73°6|12°5 | 11 10°3 7°72 |6 
99 227,006 5 | 24°6 | 20 12 12°3 | 10°8 15 90 11l‘1 | 10 9°6 63 | 5°7 
100 227 .OU7 5) | 27 24°56 1135 | 14 L1°6 2-4 86 11°75 | 12 11°75 | 7:4 |5°7 
101 227,011 e | 25°3 | 23°3 | 12°3 | 121 | 10° 16 85°4 | 12°52 | 11 9°58 | 8°42 | 4-18 
102 227,014 5 | 26 22 138°2 | 12°8-/ 11°25) 1°6 85:2 | 11°3* | 112 10 Te 55 
103 | 227,024 . 97°3 |26°5 | 14:3 | 131 | 11-35] 1:8 | 79:4 13-6 | 19-b° | il-5 | 8-8 "eq 
104 227,432 " | 24:5 | 22-3 | 19 11-4 92 2-4 75 11°5 9°5 81 7 56 
105 242,513 7 | 25 21°5 | 11°5 11-2 9-1 2°1 7971 | 11°5* | 11°5? | 11 7:2? | 5:3 
106 | 242,568 %e 24-5 |92° | 0-8 \ 11-6) 9-4) 9-3. || 9-7) aici enemG 8:3 | 63 | 64 
107 225,217 Sioux | 27-1 23 13:1 13°2 | 11°6 1°6 88:5 | 11-4 | 12 10°7 85 | 3°9 
108 225,218 = 28°5 | 25 13°9 | 13°3 | 1071 3-2 (2-7 | We Sk ae O7 8 51 
109 225,221 27°8 | 24°1 | 12-1 13°3 | 111 2°2 91-7 | 11 ll 9°8 029) eae 
110 | 225,242 ie 98 |25°5 |12°6 | 13-2 | 11-2 | 2 88:8 | 12-4 |11°5 | 97 | 8:4 | 5-1 
111 | 225,261 | is oF 1235: 15 123: | 10° | 9-39) 76-98) 10S am 9:5 | 7-28 | 4:1 
112 225,415 “ | 29°1 | 25 141 14°4 | 12 | 2:4 85:1 | 13°2 | 11:25 | 10°6 76 | 6°5 
113 225,416 *s | 28 241 | 12°75 | 11°8°| 10°15) 1-7 79°6 | 12°5 | 10°5 94 7°75 | 6°4 
114 169,672 Virginia | 28* 24°58 | 13°22 WS O22 4 eG tees || ae |p ait 9:9 72 | 45 
115 98,470 Wisconsin | 25°5 | 22°6 | 13°25 ? 10°85 2 81:9} 10°8 | 10°3 9°25) 7:15 | 5°3 
A.M.N.H., 
Neve 
116 H, 3658 | Pueblo, N.M.| 27 23°25 | 13 13°3 | il: 2°1 86°2|12°7 | 12°56 | 11 9:2 |5 
ny - e 245 [218 [125 | 12-2 | 11 12 | 88 |10°6 |10%5 |102 | 65 | 5:5 
118 55 . | 25°2 | 23 13-4 | 12°5 | 10°7 18 79:9 | 21s8) | Laos alia 76 | 53 
119 if a 24-9 | 232 |13-71 | 11-4 | 9:5 | 1:9 | 795/12 |10 9 74 [5-9 
120 5 | 27°3 | 25°4 | 13°3 | 12°3 | 10-4 19 77°4| 12 10°5 8°5 75 | 5:2 
121 H, 102 South Utah § 23°8 |19°3  12°4 | 11 9 2 72:6 |11°6 ({12°5 | 11 94 | 4 
122 H, 191 - 22 20 | 12 | 11-8 10:4 14 86°7 | 11°38 | 11 10°4 77 | 4°8 
123 H, 271 ¥ | 26°5 | 23 | 13°1 13 SLO ie TA | LESH LO) 9°6 72 |5°8 
124 | H, 307 iy | 27 93'°4 |14 | 12°1 | 106 | 15 | 75-7) 10°6 | 10-5) ie Ocyen eeeannon 
| 125 H, 321 : [2527 22:9) Mansy si Ose 9 1-2 68°7 | 10°3 | 10° 97 6:8) | 4:7 
| 126 99/2393 Mexico Q24°5 | 22 | 12 11°6 9°9 1°7 82°5 | 11°2 | 10°5 9°5 T1 (57 
127 99/2230 | _ 25 20° | 12°9 | 13°3 | 11°8 15 91°5 | 114 | 10 9 7 5°5 
128 99/2165 5 26 23°7 | 12°5 115 | 10 15 80 11 10°5 8-9 75 [5 
| 129 | 99/305 - 25°6 | 22 13:2) | 1s26 | 1956 64 727 Ors lO D2) 16x68 ase 
| 130 99/2506 on | 27 | 24-1 | 121 13‘l | 115 1°6 95 11°8 9°5 82 73 \6 
| 131 99/2426 ss | 26°9 | 24°6 | 13:1 12-1 | 10°5 16 80°1 | 10°2 9°5 9°2 61 | 5:3 
| 132 | 99/2542 As 25°6 | 22 12°8 13°2 | 11-9 13 | 93 11°2 | 10 9-2 67 | 54 
153 99/2330 4 | 25°1 | 23°5 | 13 14°9 | 13 19 | 100 12°6 | 11 9°5 85 | 5:3 
13. 99/2325 - | 251 | 214 | 13 12°8 | 11:3 15 | 86:°9/11°6 | 11 10 Catteni 
135 | 99/2539 . 98 |24 |13°5 | 12°8 | 11 18 | 81°5|12°7 | 11 91 | 81 | 5:8 
136 99/2329 a 25°1 | 23-1 | 13-1 13-7 | 11°6 21 88°5 | 12°5 | 11 9-4 8:3 | 4:5 
3a 99/2544 op 26°1 | 21 12°56 | 12°9 | 11 19 88 10:3 | 10°5 9:7 8 4°4 
138 | 99/2486 Fs | 23°2 | 22 T2265 LIES SO 1°9 79°4| 11:5 | 10 9 7 4°6 
94, Six sac. vert. (Plate [V, 4). False promontory. Spinal canal of sacrum open. 95. Shallow 
pelvis, wide outlet (Plate III, 3). 98. Six sac. vert., one from coccyx(?). 99. Six sac. vert., all 
sacral in character (Plate IV, 4). 100, False promontory (Plate VI, 6). 102. Six sac. vert., one from 
coccyx (?), 104. Six sac. vert., one from coccyx (2). 105. Last sacral segment missing. 
106. Transitional vertebra. No false promontory. Small pelvic cavity (Plate IV, 4, Plate VI, 7). 
107. Large pelvis (Plate I, 1). 108. Large pelvis. 109. Transitional vertebra, left =6, right =5 
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American Indian Squaws (continued). 


OurLer InnominatE BonE c Sackum 
oe le 
Ea | . d A d 3 
mere) 2 | 2 zl 4. Pee eee tae. we. 2 
ee\e| 22 | 3 [25/25/35 (83/28/38) 3)/2/ 3 a8] 6 [88] 2 
ea | 2 Oe | SS gs | 38 2 4 & 2 5 5 go'&> | 
eee | BP" a |" ele me) a leg) jae)” 
a | o 
| | | 
3°25 | 3 M 79°4 | 20°8 | 15°35 | 73°8 | 20°7 | 15°25 | — | 72 11:9 |) <9 6 P 2 100 
Sia|ol | R 82°2 | 19°8 | 14°8 19°9 | 14%) | 74°9/)77°6| 7°5 | 11°6 5 | Ie 1 154°7 
4951315) N Ore 20e7 || Loe20 I —= 20%" bed: | 7359 WEB 10-7 || 12-4) 5 | M 3 | 115-9 
Bo 24 ) N 72°3 | 18:4] 14°5 | 73-4} 18°3 | 14-2 — |69°4) 885/107 | 5 M 2 | 120°9 
3 22 | M 82-4 | 19°8 | 15 20 15‘L | 75°5 | 80 O52) G9 6 S 2 113°3. 
3 33 | M 86°5 | 18°7 | 14°4 | 77 18°4 | 14-4 a rhs 9-7 | 10°5 6 | M | 1 | 1082 
325/32 | M 100 20°1 | 14°5 — | 20-2 | 14°8* | 73°3| 74°8| 10-6 | 11°7 i) | M 3 | 110°4 
37259) 3 M 76 18-2 | 14 76°9 | 18°2 | 14 GO=2s | 92 11°5 5 M 1 127°3 
3°25 | 2°8 M 88°5 | 18°7 | 14°6 — 187) l4-7 | 78:6))"71-9'| 9-4e | 11-5 G6 S Me) ita keyy 
3°25 | 3°5 R 87°4 | 20°2 | 15°6 — |20°4)15°1 | 74 | 74°7 | 10 117 5 M Ly) se 
31 |4 N 70°4 | 18°6 | 13°75 | 73-9 | 18°3 | 13°75 | — | 75:9, 9 11°2 6 M 2 | 124°4 
3°25 | 3 R 95°7 | 17°4 | 13°25 | — | 17°44 1386 | 78:1|69°6) 8-74 | 10 5 M 1 114°9 
2°75 | 3 M 74°8 | 18 141 | 78°3 | 18 14:1 — |73°5] 9:8 |10°5 | 5/6 M 2 | 107°7 
3°75 | 371 R 93°9 | 20°4 | 16-1 — |20°5)16°1 | 78°5| 75°6| 9:9 | 13:2 5 M 1 133°3 
4°75 | 34 N 82°3 | 21°3 | 16°6 SS Oi ase8. ©) V3.8. 751 |) 9g. | Wiles 5) P Me eee rl 
3°75 | 33 N 89°1 | 20°1 | 15°6 | 77-6 | 21-1 | 15°5 — | 75°9|10°8 | 11°5 | 5/6 M 3 | 106°6 
4 3°2 N 78°2 | 20°2 | 15°5 ==. 9055.) 153. | 74°6 | 73-2) 10:5 | 12°1 i) Ss 1 1152 
3°25 | 2-9 M 87°9 | 20 15°1 | 75:5 | 19°9 | lol —— | 741} 9°83 | 11°5 5 M 1 117°3 
3 37 N 80°3 | 21°5 | 16°9 | 78°6 | 21°5 | 16°8 — | 73°8} 12 W215] 5 M 2 | 101°2 
34 |38 | N 75°2 | 20 16 80 19-4} 15°35} — | 71:4) 81 | 12 5) P 1 148°1 
2°75 | 3 Mi 89°2 | 18°8 | 15 — | 19-1 | 15" 78°5 | 68:2} 9:9 | 11°9 i) M 3 | 12072 
2°38. |325| N 85°6 | 20 14°7 | 73-5 | 19-7 | 14°4 —— | 784/105 | 11: 5 M 2 | 112°4 
| 
4:3 | 3°3 M 86°6 | 20-1 | 15°4 — | 2071/15 | 77:1 | 74:4] 1071 | 11-4 5 8 3 | 112°9 
37 |25 | M 96°2 | 18°75 | 14-1 — |18°5)14°3 |77°3) 75:5) 9-7 | 11°5 5 | M OR) Malye: 
Bie | 2 R 94°1 | 18°8 | 15-4 — |18°8)15°6 |77°7| 74:6) 10-4 | 11°5 5) M 3 | 110°6 
3°8 | 1:9 M 75 18°9 | 14°28 | — | 19 14°2* | 74°7 | 76°3 | 108 11°9 5 Me as 
3 16 M 70°8 | 18°2 | 14°7 — |19°1] 15-2 | 79°6 | 70 Wilesy = | Wey, 5) S 2 | 103°5 
4 2°6 R 94°8 | 18°3|13°3 | 72°7 | 18:2 | 13°32" | — | 76:9} 9-2 | 11 5 M 3 | 119°6 
4 12 M 92 19°6 | 14°2 — | 19°8 | Nasa) | Fla |.90 8:2 | 10°4 4 | M 1 | 126°8 
35 | 2°] R 84°8 | 19°2| 14°8 | 77-1) 19:2) 14°8 | — | 72:5] 11:3 | 11°3 5 | Reverse | 5(?) | 100 
A] | 2°2 M 83°6 | 20 14°3 — |20°3|14°6 | 71:9] 75-2 | 10°8 | 1271 6 | M | 2 | 112 
Bro | 2:9 M 94°1 | 18:3] 13°8 | 75:4 | 18°1 | 13°8 S28 | LOR7 6 | S By | why 
36 | Ll R 84°8 | 18 13°7 | 76°1 | 18 13°7 — |73:5) 9 | LO 5 M 1 114°3 
4 27 N 79 19°4 | 14°9 = | 1987 NaF 74-6) 78:8) TA | tS 5 M 3 | 103°6 
38 | 2:9 M 80°9 | 19 14*1 | 74:2 | 18-9 | 14°3 =) Foy Ser ah 5 | S 3 | 1134 
3 2°5 M 97°1 | 18°9 | 13°6 — |19:1]18°4 | 7071 | 74:9} 10-1 | 10°8 5) M 1 | 106°9 
4 2°4 vV 69°5 |} 19°6 | 144 | 73°5 | 19-4 | 14°3 72°6 10 12°1 5 M ire pple 
3° | 2'1 N 90°2 | 19 15 78°9 | 19 14°8 — |706] 91 |11°3 5 Je il 124°2 
27 ! M 821] 19°3 | 14°3 | 74:1} 19 14°3 75°4)11°2 | 117 | 5/6 M 2 | 104°5 
3°2 | 2°5 M 75°4 | 20°6 | 15 72°6 | 20-4 | 15 == | er il | 10°1 | 12°2 5 M 2 | 120°8 
4°77 | 2°38 M 86°2 | 18°8 | 14-7 | 78-2 | 18-8 | 14°5 — |74:9) 9°6 | 11°5 5) M LT) 119%8 
3°7 | 2°5 M 86°6 | 19°9 | 15°5 — |20°4)15°3 | 75 72°9 | 10°6 | 12°5 5 ) 4 /117°9 
3°8 | 2°5 M 75°2 | 20 14 — | 2071) 14:1 | 7071 | 8071| 10-2 | 12°5 os) M 2 | 122°5 
3 2°2 M 94°1/ 19°9 | 14°5 | 72°9 | 19-9 | 14°5 — | 76:2) 10 12°1 5 P 1 aan 
25 | 3°3 M 78°2 | 18°9) 14°1 | 79°9 | 18°8 | 14°3 — | 8l-4} 104 |} 1074) 5 M | 1 | 102°9 
| | 
colour, identification). 117. Bones light colour, identification. 118. Sacral canal open 
lst and 2nd sac. vert. behind. 119. Exostoses, Ist sacral vert. left. 120. 3rd, 4th and 5th sacral 
segments broken on right. 122, Four sac. vert., sacrum complete. 123. False promontory. 
Shght reverse curve of sacrum. 124, Six sac. vert., all sacral, double promontory. 125. “Six sac: 
vert., all sacral, false promontory. 127. Double promontory, ‘ Male type” of pelvis. 130, Small 


outlet. 132. Transitional vertebra, right sacralized, left free, false promontory. 
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American Indian Squaws (continued). 


Lable of Pelvic Measwrements of 


INLET Penvic OvrLer IynominatEe Bone x Sacrum 
Int anal inal a z 
| Inter- Fa {) lee # 4 
U.S.N.M.| Tribe or Inter-| 2 2 2 Altes g 2 éonet || cs! 2 x > . e = 2 se 
No. No. Locality Hie spines) 5 $ 8 | 22 s Pa 8s s 3 83 | 22 | ce a ea 3 as se 4 |Se Ze Ly 2 a a 3 q £ 8 a 5 
cme |e | se lei | 2 3 |35¢| 4 g S= | 3: | Be 5 ag = |Sm| eme| os |\ee| Se} s | s = 3 |8e8 5 |88/ 3 
a |e2\|22| 8 | 2 |3e2| 3 | § |S \23 [Se| 2] 32 | 2 |ea| 2/2 |S4)e8/B)e | s | s 28) & | 88] = 
# los |e | 3 & |43e| 3 S | Sa |28) Sa! 5 FI me Wei a x a qi a |43 as 
| a FY Wey | 3) ei 5 é Hey | is} 
94 New Mexico | 2°5 fi i 10 8 5°75 | 6 P 2 100 as 
95 Pah Ute | 2 Be yy 1a ir [98 5 Pp 1 | 154-7 
96 Peru | 2-9 67 10°25) 9 75 15:3 5 M 3 115°9 
97 = 2-4 5 10 86 | 81 155 5 M 2 | 120°9 
98 _ 2-5 11 103 | 77 |6 | | 2 8 2 | 1133 
99 » 15 10 a6 63 |57 6 M 1 | 108-2 
100 A 24 12 11:75] 7-4 15:7 5 | M 3 | 110-4 
101 . | 16 11° 9°58 | 8-4" | Ja 5 M 1 | 127°8 
102 ” | 16 11" | 10 7 | 5:5 k 6 8 Lj i7 
| 103 mn 1:8 | 79: 12°5 | 115 | 8:8 | 61 ‘ 5 M L 
| 104 5 Q-4 | 75 975) 8-0 | 36 R 6 | M 2 
105 » 2:1 | 79-1 58} 11 | 7-281 5:3 B: Baily atl aa) 
106 ” 2-2 | 79:7 9 8:3 | 63 | 6-4 : x i 5/6); M 2 | 
107 Sioux 16 | 88:5 12 10:7 | 8:5 | 3:9 3°9 | 20- F 20° 5° 6) 99 5 M 1 
108 ” 32 | 72:7 11 97 | 8 a1 N 82°3 | 21:3 | 16°6 21-4| 15-8 | 73°8| 75-1, 96 5 P i 
109 os 22 | 91-7 ia 98 | 76 | 4-7 N 89:1 | 20-1 | 15-6 21:1|15°5 | — | 75:9/ 10:8 |11°5 | 5/6 M 3 
110 = 2 88'8 11°5 O7 8° Bl N 78°2 | 20:2 | 155 20°5 | 153) | 74°6 | 73-2) 10° 5 Ss 1 
dil 5) 23 | 76:9 11 9:5 | 7-28) 4-1 | M 87:9 | 20 | 15-1 19:9] 15:1 | — | 74-1] 9:8 6 M 1 
112 m 2-4 | 85-1 11:25}10-6 | 7 6°5 N 80°3 | 21:5 | 16-9 21:5) 16-8 | — |73:8)12 | 5 M 2 
us Se 2E 7 79-6 | 1 10°5 9-4 7°75 | 6:4 N 73°2 | 20 16 19-4] 15°85 | — | 71:4) 81 5 18 1 
114 Virginia | 28" 16 | 77-3 11 99 | 7 45 M 89-2 | 18-8] 15 19°71] 15" | 78-5 | 68-2] 9:9 5 M 3 
115 98,470 | Wisconsin | 257 1 81:9 10°3 9:25| 7: 5:3 N 85°6 | 20) | 147 19°7 | 14-4 — | 78-4) 105 5 M 2 
A.M.N.H., | 
N.Y. 
116 | H, 3658 Pueblo, N.M. | 13°3 | 112 | 21 | 86-2)12°7 |12°5 }11 9:2 |5 M 86-6 | 20-71] 15-4 | — | 201]15°5 | 77:1] 74-4]101 | 11-4 | 5 S 3 
ii ” 5 12:20) 01 12 | 88 [10 |10° |10:2 | 6:5 | 5:5 M 96-2 | 18°53 | 14-1 | — | 18-5 | 14:3 | 77:3] 75:5] 9-7 | 11-5 5 M 2 | 
” 125 | 10°7 18 79°9|11°8 | 125 | 111 76 |53 | R 9471 | 18°83 | 15-4 — |188|156 | 77:7 | 74:6} 10-4 | 11-5 5 M 3 110°6 
” | 11-4 95 19 72°5 | 12 10 9 74 | 5:9 M 75 18:9} 14:2" | — | 19 14:2" | 74°7 | 76:3 | 10" 11:9 5 AML fy i 119 
” | 12°3 | 10-4 19 77-4) 12 10°5 8:5 75 | 5-2 M 70°8 | 18-2 | 14:7 — | 19:1] 15-2 | 79°6| 70 11°3 | 11:7 5 5S | 2 | 103°5 
South Utah | 11 9 2 72:6|11-6 (12:5 | 11 ga |4 R 94:8} 18-3] 13-3 | 72:7 | 18:2| 13:3" | — | 76-9] 9-2 | 11 5 M 3 | 119°6 
a | 11°8 | 10-4 | 1-4 | 86-7}11:3 | 11 10-4 | 77 |4:8 M 92 |19°6]} 14:2 | — | 19-8] 14-2 | 71-7] 90 8:2 |104 | 4 M 1 | 126:8 
. } 11-8 |10°1 | 1-7 | 77:2}11:8 {10 96 | 7-2 158 R 84:8} 19:2| 14°8 | 77-1 | 19-2] 14°8 | — | 72°5]11:3 | 11:3 | 5 | Reverse | 5(?) | 100 
es 121/10 | 15 | 75°7|11-6 105 | 9-7 | 71 |62 M 83°6]20 |14:3 | — | 20-3] 146 |71-9]75-2/10:8 |121 | 6 M 2° | 112 
| FD: | 10-2 | 9 1:2 | 68:7]10°3 |10° | 9-7 | 68 | 4:7 M 94:1 |18°3] 13-8 | 75-4] 18:1|13°8 | — |71-2]11-8 |10-7 | 6 s 5 | 90:7 
| Mexico IL-6 | 99° | 1-7 | 82°5/11-2 110° | 95 | 7-1 | 5:7 R 848/18 | 13-7 |761)18 |13:7 | — |1735] 91 | 10-4 | 5 M 1 | 114:3 
| | ” 13°3 | 11:8 | 1-5 915) 114 | 10 9 7 55 N 79 19-4 | 14-9 -— |19-7 | 14°7 | 746] 78:8) 111 }115 | 5 M 3 | 1036 
» 115 | 10 15 80 11 H 8: 7D IAD, M 80°9 | 19 14:1 | 74:2] 18:9} 14°3 Se | ePIa| Gee | 5 5 3 | 113-4 
0 11°2 | 9:6 | 1:6 | 72-7} 10°3 9: 6G | 4-6 M 97:1 | 18-9] 13-6 | — | 19-1] 13-4 | 70-1} 74:9] 10:1 |10°8 | 5 M 1 | 106-9 
rp 131} 115 | 1:6 | 95 | 11°8 8 rs |6 | v 69:5 | 19:6] 14-4 | 73-5|19-4/14:3 | — |72-6)10 |121 | 5 M 1 | 121 
ss 26°9 12:1 |105 | 16 | 80-1 | 10-2 9 G1 | 5-3 | N go2}19 |15 |789]19 |148 | — |} 70:6! 91 |113 | 5 P 1 | 1242 
n 25°6 132/119 | 13 | 93 | 11-2 | 92 | 67 [54 M 82:1] 19:3] 14:3 |74-1]19 | 143 | — | 75-4) 11-2 | 11-7 | 5/6 M 2 | 104-5 
9) 25°1 14-9 | 13 19 | 100 | 12-6 | 9: 85 | 53 M 754] 206/15 | 72°6 | 20-4] 15 — |82-1}1071 |12°2 | 5 M 2 | 120°8 
| ” 2571 12°8 | 11-3 | 1:5 | 86-9] 11-6 10 77 | 52 M 86-2] 18:8] 14-7 |78:2]18:8|145 | — | 74:9) 96 |11:5 | 5 M 1 | 1198 
| ” 28 12°8 | 11 18 81°85 | 12°7 ‘921i 81 | 5:8 | M 86°6  19°9 | 15° — |20-4]16°3 | 75 72°9|10°6 | 12°5 5 5 4/1179 
” 251 13-7 | 116 2:1 88°5 | 12-5 9-4 8:3 | 4:5 M 75°2 | 20 14 — | 2071] 14:1 | 7071] 80:1| 10-2 | 12°5 5 M 2 | 122°5 
138 | 99/2486 - 26g. 12°9 | 11 19 | 88 | 10:3 97 | 8 | 44 M 94:1] 19-9] 14-5 |72°9]19-9]145 | — |762/10 |121 | 5 ip 1 | 121 
Ep ” 23°2 11-9 | 10 19 | 79-4] 11-5 9 7 46 M 782/189] 14:1 | 79:9] 18'8| 14:3 | — | 81:4) 10-4 | 10-7") 5 M 1 | 102-9 
94. Six sac. vert. (Plate IV, 4). False promontory. Spinal canal of sacrum open m Of colour, identification 117. Bones light colour, identification. 118, Sacral canal open 
Pelvis wide outlet (Plate III, Sh 98. Six sac. vert., at from coccyx (2). a he 1o8: Shollew Ist and 2nd sac. ee behind. 119. aaeee, Ist sacral vert. left. 120. 3rd, 4th and 5th el 
poets ng naracter (Plate IV, 4). 100. False promontory (Plate VI, 6). 102) Six sac\vert=oueltrom segments broken on right. 122. Four sac. vert., sacrum complete. 123. False promontory. 
ae “Tar sana 104. oe Sac. vert., one from coccyx (?), 105. Last sacral segment missing. Shght reverse curve of sacrum. 124, Six sac. yert., all sacral, double promontory. 125. Six sae. 
Toe r ml ional vertebra. No false promontory, Small pelvic cavity (Plate IV, 4, Plate VI, 7). vert., all sacral, false promontory. 127. Double promontory, “ Male type” of pelvis. 130, Small 
- Large pelvis (Plate I, 1). 108. Large pelvis. 109. ‘Transitional vertebra, left=6, right=5 outlet. 132. ‘Transitional vertebra, right sacralized, left tree, false promontory. 


segments (Plate IV, 4). 111. Coccyx ossified to sacrum (not included). 116. Wide outlet (bones dark 
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Table of Pelvic Measurements of 


a a Oe a 


INLET PeLvic 
A. M.N.H., Myibevor! ec | iatees ae ier: Bee = E 
No. | N. Y., No. Locality | ‘spines| & 32/8 S 5 Peles) Seer. | S5 
cms, PS |)ss/se|s | 3 |§e3| a | 2 | eS les 
¢ |\aa122\ 2] 8 joe) au er 
& |SA|SS| 8 | 986) selene 
se —_ Ay 
139 | H,15075 | Apache (Arizona) | 27 21:4 | 18°3 | 14°73; 11°5 | 2°8 | 86:5 | 12°8 | 129°5| 10°} 8:9 | 5-2 
140 | H, 15098 Pueblo 1/265 | 23:5" |14 | 113] 9:5 |1:8|67°8) 12 |115]10 | 85 | 5:5 
141 H, 15124. Arizona 27 23°7 | ISA | WT Sea lt) OsS aa 10:25 5 
142 | H, 15094 - 24 21 12:1 | 12-4] 10°5 | 1:9 | 86°8| 11-9 | 11 9:9| 7:9 | 4:6 
143 | H, 15154 * 25°5 | 22 14 TOSS SOLQ IGG: kGoe7 slo Le || WOe2)) 73" a25s8 
144 H, 15124 rs 25 22°2 | 13 10°6:) ~9:2)) 1:4) 7037 | 11-4) 10 10 66 | 6:3 
145 | H, 16035 S.E. Utah 24 | 19 11°6 | 11:8] 10-9 | 0-9 | 93-9 | 10 9 8:4} 66 | 45 
146 99/106 Eskimo Q4a"He | QS \k12-1 | Oran 7c5ue 9) Olson Pe Ors aa 8:8} 8 4°] 
147 99/3743 Brit. Columbia | 28 | 24°12 | 74 13°6. | 19°3 | 1°3.| 87-9) T8245 Da Os Gas e725) 
148 99/3737 a5 95°4 | 23 13:57) Vay |e Sey Uist |eoi sae eT 10°55] 7:5 | 5 
149 | 99/1614 fee see 24 |205 |11-7| 12:3] 10-8] 1°5| 92°3| 10°6 | 105/10 | 71 |5 
50 99/3756 Brit. Columbia | 27°6 | 24-4 | 13 12°4 | 10°5 | 1:9 | 80°8 | 13 15) |) LOM SLOG. 
51 | 99/1720 | Fort Rupert, B.C.| 23°7 |19°5 |13 |11:3] 9 | 23/869) 12 |12 | 10:3] 8-2 | 5-2 
52 99/1731 oF 25°8 | 24 13°8 | 11°8} 10°5 | 1°3 | 76-1 | 10°8 | 10:5] 10:3} 6:3 | 6 
53 99/1727 64) 24 22 13-1 | 13°4) 11 2:-4)84 | 12°38 | 12 99) 9 5 
5A 99/1741 of 24°8 | 22 13°4 | 11°8.| 10°] | 177 | 7574) L255) | TES Osa eons coro 
55 99/1670 Nimpkish, B.C. | 25°5 | 23 13 14°3 | 12°9| 1°4 | 99°2} 10°9 | 11 LO 7 AT 
56 | 99/1674 . 24°5 |22°5 |13-4/13:1)12 |11]/869/13 |115]103) 8 16 
57 | 99/1676 i 26% | 24-99 114 | 194/11 | 14/786) 142 | 13 | 11-2] 95 | 6-1 
Usain. VE 
158 225,407 Sioux 25°8. | 22 11°8 | 12°4] 10°8 | 1°6 | 91°5 | 11°5*) 95a 8:58] 6°52 | 5-3 
159 225,412 * 26 22 13:3 | 17) 9:5 | 2°2| 71:4) 10-4 | 10 9:2) 6°75 | 46 
160 225,414 3 26°5 | 24 13°5:} 13:1) | 11:8 | 1:3) 87-4) L153: |) NOe5a| Oeste GoM ras 
161 | 225,408 Dakota 26°8 | 22 sie |lessey|) UL ARV isos ar. |! T1 1071} 78 | 4:4 
162 225,409 As 25°33) | 9S | WS | L257 | NOs 222801) LIT eae As Ss Sia ese) 
163 225,406 Cheyenne a) || Il 13 12°9| 11-3 | 1°6 | 86°9 | 12°1 | 11 10°3| 7:4 | 671 
104 261,810 Arkansas 2th 25 L4eO SOL Sa 9) 1°8 | 64:1 | 11°8 9 8:4|/ 6:5 | 5:8 
165 | 261,824 ¥ 98:2 | 23°59 114:5112:7/11 |1°7| 75:9) 13-2 |11 |105| 75 17 
166 wlaG Argentina 26 22 Ib PAST TE ibssay | 102) 1°35 | 9328) T2575 VOc5s Osa tara 6°6 
167 262,577 Arkansas 26°75 | 23 13°5 | 12 10 2 (4-1) 11-8 | Tat 97) TE | 4:7 
168 262,570 - 27 Zot \ LsOn LA ef aL 1‘7 | 88 11°3 | 10°5| 9 75 | 4:5 
169 264,488 Peru 26°71 | 23-5 | 18 13°5 | 11°5 | 2 88'4 | 10°6 | 10°5| 102] 68 | 4°8 
L170 264,489 - 26°6 | 23°9 | 12°5 | 12°3 | 10°3 | 2 82'4 | 12°3 | 11°5|10°8|) 7:3 | 6-6 
171 #18 Arizona 25 23 11°9} 10:9} 9-1} 1:8] 76:4} 10°8 9 8:8; 63 | 56 
L72 7h 0) 5 Zone 9 | LOS LIES elO Sale 1°8 | 70°3 | 11 10 8:3) 75 | 4:3 
173 *83 % 25 pyady lege} || AE eIay i) 21/762] 116 | 105] 9:4) 7:3 | 5:3 
174 | *161 ie 244 |1971 [13 | 10-4] 8-8} 21/63'8/10 |11 |10 | 6 147 
L75 59) 5 27 | 245 | 12-85) 11-2) Oe 21 | 7a | aca OL oa aes 85 | 5-2 
176 ne . 24°5 | 22 12°59) 11:36 95.18) 76 TO S295 9-4) 7°52 | 4°6 
LO *104. 5 26 23°5 | 12°83] 11:4] 972) 2:2) 71:9) 11-4 975 | 86] 7:3? | 5:2 
17S 112 om 2535, 1) 223) 12-9e WF Seon Oie Geta lls OLDE Eo 75 | 5 
179 *82 ie 95°3 | 23°5 | 13°5 | 11°8 | 9:7 | 2:1} 71:9 | 11:8" | DOR oer leee 2am noe 
1LSO 7) ns 25 22°5 | 12°79) 115) 9 2°31 697 | LOU OLS e Oia ed 4°] 
1IS1 *7] ; 23°88 | 20°9 | 12 11:4| 9:1 | 2°3| 75:8 | 103 | 11 i) | ei) ass) 
182 45) e 25 23 TASTE | ARTES I Ely 9g) 2°1| 74:4] 10°8 | 11 97| 83 | 44 
183 | *Cave 3 fe 97°8 | 24 13-9] 11:5 | 9-2|2°3/67 | 10-5 | 10-5 | 10 72 | 4:8 
184 Pilly 2 a 25°9 | 22 13°5 | 12°5 | 10°5 | 2 afeoulmelalics s ele] LO 25 eae aa 
185 #52 i 24:5 | 22-2 |13 | 10-1] 8-4] 1:7] 64:6] 10-4 | 10:5] 9:4] 7:5 | 4:1 
139. Transitional vertebra, left sacral, right free. Large pelvis. 145. Small pelvis. 146. Flat 
pelvis. 147. Large pelvis. Last lumbar articulates with right wing sacr. 148. Six sac. vert., 1st 
tends toward lumbar type. False promontory. 156. Six sac. vert., last teuds toward coccyx. 
157. Transitional vertebra, right free, left sacralized. 158. Six sac. vert., one from lumbar. False 


* Original number. Specimen not yet permanently catalogued. 
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American Indian Squaws (continued). 


OurLer Innominatré Bonn SAcRUM 
x ee ee ae = P= iy | eee es Oe see 
he zg ei 5 3 = a 4 eS Z A 2 = 3 ® = o g3 “| 
Bee | as | 2. (ee|/ 34] 8 1ee/ 33/38) 2) @] BS lee 3 ea] 3 
30 : So | Oe ie a |'o o o 2 5 = 3075 | 
Seueetoe: | = [em] ee) Aa ler) ee ale | | ge lee] Oo 6S] 8 
Cs) 5 n S 
Ay ay 
4°2 3 M 82 204) 16 20°6 Deo ee nO snl) lle Oe eG M 3 | 108°6 
4°5 Def M Sordi! 925)! 48 7529) | 19:5.) 14-82) == 173-6) 82. | dil-9 ) M 1 | 145°1 
3°38 3 R 94°4) 19°2 | 15:2 | 79-1 | 19°24] 14°5 Se lee Poel Ties 5 M 3 107°3 
od: Dll R 83°2 | 18 13e3 Seis ae eeoaleAD NOEZ5 ile, 5 Ss 4 109°8 
3 3 R 85 18°4 | 13°88 | 75 IRoss53 | [ko33} 225s LO 2) alate D) Ss 1 108°6 
3°3 2°6 M 87°7 | 18:2 | 13°8 — 18°3 | 138°7 | 74°8 | 73:2 | 10°2) 11:1 5 Ss il 108°8 
3 i M 84 Toc he ON 326 76 (AO S21 ORG |b M il 129°3 
Sd) 3 M DSO TES! 13:35 ey als 13°3 | 73:9 | 73:5 | 93 | 11°6 5 Ss 33 124°7 
2°8 Bi) M 83°6 | 20°5 | 16°2 SZ 0e (eG E22 iS eho OullOme | 2235/5 M 3 | 121°7 | 
3°3 2 R 92-1 | 20 14°8 = |) 20:4) | 1465 70 80-3) 126) | 12°F | 6 M 2 100°8 
2°8 meal R 94°3 | 18 13D len LOsOn bs 2a) FORSUsoiellied | LU 96 M 3 95°7 
3°8 2°5 M hat) 2d 9) GEG) 7528 21-6 |) 16s | 79%3 | 10 11-7 | 5/6 M 2 Wily 
oe} 11%} R 85°8 | 18°6 | 14°5 18°7 | 14°8 — | 78°9| 9:4] 11°74 5 M il 1213 
3 2°5 R 95°4 | 20°5 | 15°4 | 75-1 | 20°3 | 15°4 = | 79:4) (952) 129 5 oP il 132°6 
Do 2°4 R 77°3 | 20 AO HOST 15 S| B33) UE Pallas) ‘ M 3 107 °2 
3°2 oul R 80°1 | 19°6 | 14°6 | 74°5 | 19°5 | 14:7 = | 7) relay || NP 5 iP 2 144°7 
Bie 23 R 92°7-| 19:8 | 14:7 a TR als) (Ore | 18 9°6 | 10°3 5 Ss 4 107°3 
3°7 2°9 1a, 79°2 | 20 14°6* | 70°3 | 19°8 | 14°6 81:6} 7:6] 11°9 ‘5 M 2 Ifo Ou 
3°3 27 R 78:9 | 20°62] 15:52] — | 20°6| 15°5 | 75:2 | 79°2|10°4/ 12:3] 5 M 1 TMS 
3 2°6 N 73°9 | 19 A |e LORD Ay Wee) 75-5) | Fae 22 Si 6 M 2, 98°3 
3:25 | 2:9 M 88'4 19-6 | 15 76°5 | 19°6 | 15 — |75:4)1071)11°3] 5 M 1 Es 
S99) ya} M 82°3 | 20°5 | 16°2 20°6 | 16 uneone || \epaitirh |) AMOS) || Key 5 M 33 98-2 
S3°7hay || 78 7/ R 94:4 | 20°3 | 16-4 SS 2Or4 et ny moa eLORs ile, 5 NS) 1h yal} 
3 374 R 97°4| 19-4] 14°8 Oe aml alps On iro Os: (elated: 5 M 2, 107°6 
33045) Bie M Soli 20 One, IES)SS) |) alisyal == ||| yAcel  SN aig hse 5 iP 1 123°1 
D°5 2°9 N 2 20D) 14-30 7078) || 1988 | 14e38 |) = | 74-8 Pa | ed | 6 M 1 100 
3:5 374 M 79°5 | 21-4] 15°58] — QICOMED: bd | ale 8 "7626 lle | ) M 33 106 
3 2°8 N 76°4 | 20 Seow el ZOkS lp len 7 esse fone leo! (ll 8:l) a6 M 8} 99°1 
3°5 2°8 R 82:2 | 19 Toei 792519 14°9 SS 7 TEE Gr Les 6 M il 107°8 
334) 2°9 N 79°6 | 20 14°6 | 73 TOON Ae == iT | LOS 1227 5 M 2 117-6 
Somaya at R 96°2 | 19 OR e747 eSi San Oe SEN al 6 M iL Tell 
Si7hay| WARS) M 87°8 | 20 14°9 =|) ii) Lo Tipe eee |e 5 M 2 LEO ST: 
3°3 2°5 N Slow M7 ares | 74 -Gilnad eloeoel = 70°81) 875} 10k 3) M 3 118°8 
Bb 3°2 M 75°4|17:4| 13°6 | 72-4] 17°3| 18°62] — | 75 926) 18. 5) Ss 4 114°6 
3°2 Q°4 N CUCOM ie ds i 7Oedal lisse sa Ye WG8csal LO 108 | 35 S 4 | 105°9 
Bi) 2°3 R 100 72 ailin te 75e6: | bye29| 138 == | 70a || 8/4 |) ihe i) M 2 | Wi} 
3°6 3°1 N 64°5 19 TsO male lle US 7 We Ae en Ollie C248 ST 5 M | ite3il 
3°6 3} M 87 18 17 eS |) ais sere | afasen a) aise) || aeons aly baal 5 M De L207, 
Bis} oul N 75°4 | 18:9 | 14-6 | 77-2 | 18°99) 14-62] — | 72:7 8°54) 12 i5) M 2 Ta 
3°6 2°3 N 80°4 | 18:4 ]-13°3 | — | 18-7] 13-2 TOON eo co LO es Lol 5) M 1 | 109°9 
3°6 3 N 82°2 | 19-6] 14 | — |19°8} 14°3 | 72°2| 74:3 10°5 | 11°6 5) M 2 LOSS 
3°2 2°9 R 96 18-1 | 13% | 71:8) 17:8 | 138 aloes: (lal: 5 M 2 |) 11873 
3°4 2a R 9671 fare Als} Placa lion me eAcOs ln Acoeleosoal a0 u/ 5 5 3 109 °2 | 
4 2°9 R 89°8 | 18°34] 13°7 SSSA 3e9 psd | aros6 |e TOZDAT 25 M iy alaka3ss} 
4 3} M 95°2 | 19-5] 14-2 Se LOL y el s3e liacon err EEA |) ALY 5) Ss A loa 
3°7 2°6 R 91°9 | 20 Haale oa Loco Las = |r| 5) )) PAL 5 M Il 1301, 
3°5 2327) R JOSH ES: lost lr eel) Ui 336 = 72) 9 ORI" Yb M 3 | 118°9 
promontory. 160. False promontory, Ist sacral has lumbar characteristics. 162. Wide outlet. 
Coccyx ossified to sacrum (not included), 164. Six sac. vert., one from coccyx(?). Small outlet. 
166. Six sac. vert., all sacral. Double promontory, rounded. 167. Six sac. vert., all sacral. 169. Six 
sac. vert., all sacral in character, 174. Flat pelvis. 175. ‘Male arch.” 181. Youth. Sacral 


bodies not quite united, 
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Table of Pelvic Measurements of 
INLET Penyic 
Inter- 7 
A.M.N.H.| ‘Tribe or _— crests |Inter-| 2 | , | 52] o Palit ; E 
N. Y., No. Locality spines} § | 33)28) 2 my cet) 82/28 
ems. Z3 35 g 3 |38 £ iat Se gs 
& |2a/28/ 8 5 |aes| 3 @e|265 
& [Sa /os|a |] ~ |4848/ & a | a 
— ————| | = 
Apache (Arizona) 21° 13°3 | 14°3 | 1175 | 2°8 | 8-9 | 5:2 
Pueblo 2. 14 11:3} 9:5) 1°8 85 | 55 
Arizona 2 13°1] 111} 9:4] 1°7 7 5 
” 21 12°] | 12-4} 10-5 | 1°9 79 | 46 
| ” 22 l4 10:8} 9:2} 1°6 73 | 5:3 
4 22-2 113 | 10-6) 9-2] 1-4} 66 | 63 
S.E. Utah 19 11°6 | 118 | 10°9 | 0-9 G6 | 45 
Eskimo | 21-8 9°: 75) 1:9 8 41 
Brit. Columbia | 24-18 2°3.| 1:3 66 | 7-5 
” 25-4 | 23 er 515 
[Queen Charlo: toa. 20:5 15 71 (|I5 
7 Brit. Columbia | 24-4 19 85 |6 
99/1720 | Fort Rupert, B.C. 19°5 2°3 82 | 5:2 
99/1731 | 2: 13 63 | 6 
99/1727 22 2-4 |b 
99/1741 | 2 17 75 | 5d 
99/1670 } 14 7 47 
99/1674 } 11 8 6 
99/1676 1-4 95 | 61 
Sioux 6:5" | 5-3 
i 6-75 | 46 
» 79 | 48 
Dakota 78 | 44 
| 5 8:8 | 3-75 
Cheyenne 74 | 61 
Arkansas 65 | 58 
oe 75 |" 
Argentina 7 66 
Arkansas 76 | 4:7 
5 Pp? 75 | 45 
264.489 ial a ais 
ci feel! ” 73 | 66 
18 Arizona 56 
ue » 43 
01 ; : tr 
"2 ; 2 : 52 
p *3 5 2°5 10°8* | 11 9-4 4G 
plo ” 28 Il-4 95) 86 52 
we) ” 2-9 11-2 |105| 9 5 
82 bs 35 118 | 105 | 9-7 62 
peel ” 2-9 10°71 |10%} 9-7 41 
me > 2 103/11 | 99 58 
Ale 0 21 10°38 | 11 7 A4 
Cave 3 Rs 39 10°5 | 10:5 | 10 48 
192 A 3:5 Wel | ai | 10-2 47 
52 rn 3 10-4 | 10° | 9-4 41 
139, Transitional vertebra, left sacral, right free. Large pelvis. 145. Small pelvi 146, Flat 
vis, 147. Large pelvis. : Last lumbar articulates with right wing sacr. 148. Six sac. vert., Ist 
tends toward lumbar type. False promontory. | 156. Six sac. vert., last tends toward coccyx. 
157. Transitional vertebra, right free, left sacralized. 158, Six sac. vert., one from lumbar. False 


Coceyx ossified to sacrum (not included). 
166. Six sac. vert., all sacral. Double promontory, rounded, 
sac. vert., all sacral in character, 
bodies not quite united. 


174. Flat pelvis. 


OutLer Tsnouinate, Bone Sacrum 
“ 
ea ie 5 re = 
3 3 , 2 a | as 9 ge) » 
| i6 || BAP || & Bz Sel ta Way) 
g3|2| 34 | 3 2 a | a 2)5e/ & | 28) 8 
me] 3 a = ae es a) ened ag 
| Ss 
43213 M 76% M 3 | 108-6 
45 2-7 M 73" M 1 14571 
BYE) || & R ile M 3 | 10773 
Bt R 75 Ss 4 | 109°8 
3 R 72° Ss 1 108°6. 
33 M 5S 1 | 108-8 
3 M M 1 | 129-3 
35 M 5 3 124-7 
28 M M 3 | 121-7 
33 R 80 M 2 | 100°8 
2°8 R 175 2 M 3 95°7 
[3:8 M 79-3 | 10 oi) M 2 | 117 
33 R 78:9} 9-4 is M 1 | 121°3 
3 v 79:4) 9:2 | 12-2 1p 1 132°6 
33 R 83°3 | 11-1 aS) M 3 | 107-2 
R 79 85 | 12°3 1p 2 | 144-7 
R 78 96 3 >) 4 | 107°3 
R 81:6] 76 9 M 2 | 151-1 
R 79:2 | 104} 12°3 M 1 118-3 
26 N 19°2 744 8 M 2 98°3 
z) M 19°6 3 M 1 | 111°8 
33 M 20°6 Fi M 3 98-2 
2-7 R 204 76 Y S i {alate} 
34 R 19-7 77°8 | 10°6 | 11-4 M 2 | 1076 
33 M | 19-9 78:4) 9° 2 iP 1 123°1 
2-9 N | 4 74:8 | 12:1) 12-1 M 1 | 100 
3-4 M 7 = 76°6| 117 gail TE M 3 | 106 
2°8 N 7 = 78°3 | 11:9 8) 6 M 3 99-1 
2°8 R 82 795 71 16/125] 6 M 1 | 107°8 
29 N 7 73 74:1 | 10°38) 12-7) 5 M 2 |117°6 
2-4 R 9 747 72:4) 9:9 1m |e M 1 | 112-1 
2°8 M 8 = 75:2| 94) 11-2) 5 M 2 |/119-1 
2°5 N 746 70°8| 85 Al); 6 M 3 | 1188 
3-2 M 724 75 9°6 5 s 4 1146 
2-4 N 78:4 — | 68:8} 10:2 781) 15 i) 4 | 105-9 
23 R 756 — |70:5} 9:2 4) 5 M 2 /113 
B31 N ile — }70:1)) 8:4 5 M 2 | 131 
3 M | 75:7 | 73:9 | 9:24) 5 M 2 | 120-7 
31 N — | 72:7] 8:58 5 M 2 | 141-1 
23 N 75°9 | 73°3 | 1071 5 M 1 | 109-9. 
3 N 72°2| 74:3 | 10°5 5 M 2 | 110° 
2-9 R = | 7274) 9:3 5 M 2 | 1183 
2a R 74:6 | 74:3| 9°8 5 s 3 | 109-2 
2°9 R 75°5|73°6| 9 5 M 1 | 113°3 
3 M 73°3|70°1| 9°7 5 8 4 | 123-7 
26 R =) |) 777:2')) 973 5 M 1 | 1301 
22|| 7277/19 5 M By muss) 
promontory, 160. False promontory, Ist sacral has lumbar characteristics. 162. Wide outlet. 


164, Six sac, vert., one from coccyx(?). Small outlet. 
167. Six sac. yert., all sacral. 
“Male arch.” 


169. Six 
Youth. Sacral 
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Tuble of Pelvic Measurements of 


INLET PELVIC 
al Inter- i = a re a x 
+ U.S.N. M. Tribe or crests | Inter- L ra o 2 Q oH "S 3 a = 
| No. Ne. Locality | ems, | SPines S BS = 9 5 4 ae 3 3 3 = g ate 
| a eee ema | ae = lee8l a ee SO aes 
S on | 26 35 A ea Onma eS 3 os | 58 
gq (ayers fas) aA a & a Ay By 
a ay 
186 *2] Arizona 24-7 | 22 | 12-2 | 11-4 9°6 1°8 (see abe7e |) ial S)87/ Wisse |) 2'97/ 
1s7 *]28 .; 25°5 | 22°7 | 12°5 | 10°8 9 1°8 72 10°6 9°5 85 62 5 
ISS *68 of 24 212 =| 12-1 11°8 9°2 2°6 76 12 9°5 86 T1 54 
| 189 *73 * Qik seh 133 11°3 9 2°3 69°2} 11°5 | 11°5 | 10°7 8 4°5 
190 *34 os 24°34 | 21°48 | 11-9 | 13°3 | 11°9 1'4 | 100 12 Gay | SOE || 7 6 
191 *15 26 24 13 iil 88 2°2 67°2 | 11°42 9°52 8'1 igs 4°6 
| 192 *28, a 26°6 | 23 13 9:6 &'1 1°5 61°8| 11-4 | 10°5 D1 Nees 4°2 
| 193 *] - Ae | ORIN BIC EH Onze NG) 17) aera 10:5 Oar. Neer 58 
194 #57 * 26°5 | 24-4 (13 | 11:3 | 94 | 1-9 | 72:3) 10-8 (| 1OsS 0-8 eG omneseD 
195 *1,b 3 24:2 | 20 12°3 | 11-52 | 952) 9 77°2.| 11:74 | 10°5 | 10 63 | 5-6 
196 *64 3 26°2 | 21°5 | 13°2 | 13 10°9 2°] 82°6 | 11°71 | 10 9-2 65 | 5:6 
197 ¥79 * 24° | 20 12:2 | 11°7 |10°2 | 2:5 | 83-6) 10-3) | S-Series Saline omledcs 
A.N.H.M., 
N. Y. 
198 | 99/1677 | Nimpkish, B.C. | 26-9 | 24:2 | 13-4 | 12°6 | 11-2 1:4 +}, °83:65) 1352/5) Olean eee 82 | 6:2 
199 99/1672 ss Yoel W 2328) sal 13°6 | 12°3 1°3 93°9 | 12°6 | 12°5 | 10°8 8:3 574 
200 99/1668 5S 23 19255 L038. 28) | 1°8 | 106°9|11°7 | 11°5 | 10°8 75 55 
201 99/1669 a 26 21°8 | 12°6 | 12:8 | 11-5 1°3 91°3 | 12 115 | 10°6 8 5°6 
202 99/101 Kwakinlt 24°9 | 23 12°9 |11°7 | 10 1‘7 Tefsanl) Wey |) 1 10 9:3 | 4:6 
203 Sapte N. W. Coast 25 D5) Wesel 12 10°5 1:5 80°98) 122 ae 9°6 8 4°8 
204 | 99/1619 | Nanaimo V.C. | 23 21:3 | 12 12°5 | 11 15 91-7 |) 9:5, 7) el 10 7 3°5 
205 99/1620 5 26 24 13°7 | 12:3 | 10 23 ye jail 11 10°5 7:5 o4 
206 99/1623 sf 24 21 eyay | diet |) Ko) 1°4 V4) A225 a2 11 OU o7 
207 99/1625 . 27°8 | 24 14 12°6 | 10°5 2°1 75 13 12°56 | 11°5 85 5°5 
208 99,1626 5 249 | 22°1 13°5 | 12 10-4 16 70 Say ALI 10°2 7 5°3 
209 99/2637 N. Samich, B.C. | 25 21°6 | 12°9 | 11°3 9°8 15 76 12275 eal 10°3 7 6°2 
210 99/1699 7, 25-9 | 22-8 | 13 13°8 | 12 1°8 92:3) |) 12:9) | LS) Osa 2S: 6°3 
211 99/1520 Victoria, B.C. | 26°5* | 228 evil ||P) 10°4 1°6 79°4|11°2 |11°5 | 10°9 a 4°6 
212 99/2632 N. Samich 252 ieee lies L2°2) 1 1°2 84°6 | 12°6 | 10°52 10-4 7°32 6:1 
213 99/2666 55 24 22°4 | 13 10°8 9°5 1:3 73°1 | 13°3 | 12°5 | 10:6 9 i) 
214 | 92,183 | Wallula, Oreg. | 24-9 | 20°8 | 12-9 | 11-4 | 9-9 | 15 | 76-7] 11-1 | 12 10-1 86a 328 
U.8.N. M. | 
215 *60 Arizona QAO 2 11°8 | 11°4 | 10 1:4 84°7/;11°7 | 10 iSitoy ett a 1h 5 
216 *149 35 26 25 13 12 10°3 197 79°2 | 10°5 9°5 8:8) 4 16%5 4°8 
217 *148 . 27 23-5 seed lt 9 2 726107, || 39 8-22| 63 | 5:5 
Average t —- 25°76 | 22°66 | 12°95 | 11°78 | 10°68 1°76 | 79°75| 11°59 | 10°82-|} 9°79) 7-56) 5-21 
Largest } — 29° | 27 14°7 | 149 | 14 3°2 | 107°7 | 14°66 | 14 12°75 | 11°7 75 
Smallest == QOLSE eZ elOs3 9°4 7a) 0°8 61:5 | 9 9 8 6 2°8 
| 


188. ‘“ Male type.” 
somewhat asymmetrical pelvis. 


192. Flat 


promontory 


pelvis. 
: 196. 


189. Sacral canal open all but 0-1 cm. in middle. 
191. Six sac. vert., one from coccyx ? ‘Male type.” Narrow outlet. 

194. Transitional vertebra, left sacral, right lumbar type. 
Six sac. vert., all sacral character. 


Small outlet. 
Six sac. vert., one from coccyx (?). 


* Original number. 


198. 


Specimen not yet permanently catalogued. 


190. Narrow, 


False 


199. Transitional vertebra, free on right, one from coccyx. 
210. Six sac. vert., all sacral. 


vert., all sacral. No false promontory. 
212. Transitional vertebra, right free, but articulates with ilium, left sac. False 


sitional vertebra, left free, right sacralized. False promontory. 


217. Transitional vertebra, right sacralized, left free. No false promontory. 


t Double 


promontory =2. 
Biometrika 1x 


200. Narrow pelvis. 


promontory. 


207. Six sac. 
No false promontory. 


213. Tran- 


216. Six sac. vert., one from coccyx. 


+ False promontory = 18. 
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A. B. Emmons 57 
American Indian Squaws (continued). 
OuTLET Ixnominate Bone Sacrum 
3 = ee 
Ene: i iE 
Beis | 3 Pele ae. yd lias Bg 2a =e Via) esse 9 ae | , 
Begecioee | 2 | 2h (-32/) 2) ee) ee) 8)2+3)/3) 33 z oe las 
Be : mm ey || See gq | 84 |] oH a ¢ 2 & 2 
eee fey ieee ie ja |e ye |B la) ee | | ae | 
Ay ay oO 

3 2°8 M 82:9 18°3 | 18°6 — {18-4 | 13°6 | 73:9} 74:1} 9:1) 10-7 | 5) M 1 117°6 
3 3 M 80°2 |; 18°35 | 14 76°5 | 18°2 | 13°6 — |71°8| 96} 11:3 | 5 M 2 Diver 
2°77 | 31 N 71°7/ 18°6 | 14°2 | 76°3/18°4 | 142 — |77°5} 11 Teal 5 S 3 / 100°9 
35 | 2°8 R 93 Ike}s} |) ks} 82 18°3 | 14°6 — }:67:8 | 9:6) 11:6) 5 M 1 120°8 
28 | 2:2 N HBO OS® ae 71°8}19°3 | 14" | — | 80-2; 9:8} 11-6 5 M 1 118-4 
2°38 | 3°2 N 711.192 | 13°8 — |19°3 | 13°4 | 69-4] 74:2 | 11°14) 11:1 6 M 2 100 
3:3 | 3 N 79°8) 18:7 | 14:4 — | 19 14°6 | 76°4] 71:4} 10 HAL | 5 S 1 111 
35 | 2 M 88°2| 18-1 | 13°8 || SRE Meier risks || 7AM Ili ee) 10°9 5 M 1 101 
2°8 | 3'8 R 95:4] 20°4 | 15 73°5 | 20°3 | 14°8 = | |) BP es 5/6 M 2 111°3 
3-4 | 2°5 M 85°5| 185 | 14°2 | 76°8| 18:4 | 14°3 — |76°4, 9:4) 10-4 5 M 3 110°6 
3.2 | 2°7 M 82°9| 20°2 | 14°6 — |20°7 | 14°6 | 70°5| 79 11:4} 114 6 M 3 100 

3 3 M 80°6| 19°1 | 14:2 | 74°3/ 19°12 | 14°22 | — | 79°6| 9:7] 11°3 5 M 1 116°5 
3:2 | 2°8 M 72 TSH || alae — |19°5 | 15:2 | 77-9 | 72:5) 10-9 | 12°3 6 S 4 112°8 
3°3 | 2°8 R 85°7 | 20°3 | 15°8 — | 20°5 | 161 | 78°5 | 81°7 | 12°3 | 12°3 5/6 M 3 100 
3°8 | 14 M 92°3 | 19 14 4a 187 | V4 — | 82°6| 11°4| 12 5 S 4 105°3 
3°77 | 2°5 M 88°3 | 20°1 | 15°8 — |20°3 | 15°6 | 76:8] 7871) 9:8} 12 5 M 1 122°4 
3°2 | 2°6 R 80 18:3 | 14 — | 183 | 144 | 78°7) 73:5} 91) 11 5 S 1 120°9 
35 | 3°3 N 78°7 | 20°2 | 14°8 | 76:5|19°7 | 14°6 — |80°8| 8-2) 11°3 5) M 1 137°8 
2°8 | 3 R 105°3 | 18°3 | 13°6 — |18°9 | 13°8 | 73:0 | 82:2} 10 11°3 5 M 1 113 
3-4 13 M 95°5 | 20°8 | 14°6 | 70°2/ 20-4 | 14°5 —- | 80 9°8} 11°5 5 M 1 117°3 
S-2 | 2°3 R 88 17°9 | 14:2 | 79°3)17°8 | 14°71 — |746| 9:3} 11°6 5 S 2 124°7 
34 | 2°6 R 88°5 | 20°7 | 15°8 — | 20°8 | 15°8 | 76 74:8 | 11°5 | 12°7 6 M 1 110°4 
33 | 18 R 88°7 | 18°5 | 14°7 18°9 | 14°9 | 78:8 | 75:9 | 10°6 | 12°5 5 M 2 WAEZ/ AS) 
Zo |2°5 M Sis 13°6° 14-9" 80-71 | 182 | 14-1 | — | 74:4] 9°7| 10°3 5 SS) 2 106°2 
3 3 R 81:4 | 2071 | 158%) — | 20°71 | 15°8 | 78°6 | 77°6| 11°6 | 12 6 | S 2 103°4 
2°8 | 2°3 R 97°3 | 19°22 | 14°5 — |19°2 | 14°5® | 75°5 | 72:4) 9:3) 11 5 M 1 126°9 
28 | 21 R 82°5 | 20°84 | 14°82 | — | 20°8 | 14°8 | 71-2] 83:2} 9:1) 11°7 5/6 M 2 128°5 
35 | 2°3 M 79°7| 18:3 | 13°7 | 74:9 | 18:3" | 13°6 — | 76:2] 8-2 | 10°7 6/5 M 3 130°5 
on | 1:8 R 91 18°7 | 14:2 | 765/186 | 14:1 — |73°5] 9:6} 11°6 5 M 1 120°8 
3 2°6 N (226) U8"1 | 13 — |18°6 | 12°8 | 68:8] 76:4} 9°7 | 11 5 S 3 113°4 
24 | 25 M 83°8 | 191 | 14:2 — |19°2 | 14°2 | 74:5 | 73:8 | 10°8 | 11°7 6 M il 108°3 
3 2°4 N 76°6|19°5 | 14:5 | 74:4} 19:4 | 14°7 T2202 0:9 6/5 M 3 106°8 

N-44 4 segs. =3 Ist =90 
3°35 | 2°81 || N-M-5 | 84:26/ 19°27 | 14°52 | 74:6 | 19°32 | 14°5 — | 75:2} 10 115)4/5= 2 $52 |2nd=59/| 115°8 
45 | 4:3 M-92 |108°9 | 22°2 | 16°9 | 84:1} 22°5 | 16°8 — | 90 13°2 | 13°7 5=169| M146 | 8rd=49| 174°6 
eo) | 1°]: M-R-4 | 64 16 WS GES) oh a elds 1 68e7 |e CsA 92) |5/6==-11 P18 4th=16| 86 
R-71 6= 32 /reversed 1] 5th=3 
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‘uble of P. 1 1 : ; si 
il ———= = of RelvioMeusurementsio7 American Indian Squaws (continued). 
Inter - 
Penyio OvuTLET IxnominatE Bone Sacrum 
No. | U.S.N. M Trib Inter: ; 1 aa le =a 5 - 4 
No. .S.N.M. ribe or crests | Inter- 8 o ° a | s == ——— 
onli = es / 28 , Ls} & a a) 2 eS 
td tontity |e [one] € | 32/28 2 | g [88] 2 | 2 | Salee) fee) 2 | 4 Fe aie alcalee salma lesen Wea ii 
a | ry 3 = |23 = = | gr les) 3 aa 42 | 34 s 3 4 2 g 
2 /ei/22| <2 | 2 |ee8) =| 2 |ePl2a| Pale is a2/2/38/38/2/2)2)2) 28) 2 | de] = 
ae | a é | A BA) 3 | & | aa ae fei/3 | 3° aa a | 8 a | A) es | aa ag 
1S6 *. 7 | | | | a ot & va aes 
28 21 Arizona 12°2 | 11-4 Oya Ise | | 
Aol) he » 125 |108 | 9. | 18 Peller nos | ae 3 |28 | M 16 | — |ae4 [136 |739| 741] 91] 10-7) 5 M 1 | 176 
155) 6 308 » 21/118 | 92 | 26 ORES | 82 I 3 (3 M l4 slis2 |136 | — |718| 96/113] 5 Mel ee ng 
189 | #73 7 3. |13 | 9 | 23 HO NEEHS GAL | 6 9-7 \31 | N [ie \ia2 |ve3|ie4 jis | — [7b il ji1) 5 Ss 3 | 1009 
foal) eee ” V9 }133 }1v9 | 14 Pee MT) | ae 3-5 |28 | R | 93 |1 % \s2 |1s3 |146 | — |678) 96} 116) 5 M 1 | 120°8 
eal eras ” 2 re} | att 88 | 2-2 eee) i | © 23 |22 | N 98 lage [is | 71-8lie3 [1 | — |8o2) 98/16)  % M 1 | 118-4 
eal Cae ” d 13) |29:6 3) 821 If 16 See i P| ae Pome Se | oy \irietyaoes, |piassia— ors: 18i)) GBee ire layla) 6 M 2 | 100 
all) Wea » 13:9 |107 | 9 | 1-7 eae |ae }3:3 | 3 EN) etapa eS | = fate | vee ace pment 8 i | ona 
*ly95| #1. ” 13/113 | 9-4 | 1-9 tool tees leneealee 35 |2 | eS las isa) =) 18:8) | E86 | 7ae7)|\7Ots) Gels | 108) M 1 | 101 
Tal fae » }123 |11se| ose] 2 | bree fenced |. fos |38 | RB | 95-4] 20-4 [15 | 735) 208 | 148 77 | 9:7|10°8| 5/6 M 2 | 1113 
ior | 79 » 5/132 |13— |10-9 | 21 be Ieee | 56 Bee eal ete), asroi|p1a;5 | 14am 76:8)| 18; Mais tS rer 5 M 3 | 1106 
sp (a D ; 12:2 |11-7 |10-2 | 15 pial eeeall 56 32 |27 | M | 829/202 |146 20-7 |14°6 |70:5|79 |1l4| 14) 6 M 3 | 100 
ALN. HM, Se 43 3. (3 M | 80-6| 191 | 142 | — |79-6| 97/113] 5 M 1 | 1165 
N. Y. 
99/1677 : eae il 
an 134126 | 12 
oo/18Te Bl [136 | 123 ue 95 | 82 | 62 aloe | m | ve |a93 [152 | — | 19% | 16-2 | 77-91 72°5) 108 12:3) 6 s 4 | 1128 
SU/NGRS ‘3 [128 | 125 | 108 | 8°3 | 5:4 3 |o8 | R 7/903 [108 | — | 20% | lel | 785 81-7|12°3|123) 5/6 M 3 | 100 
99/1669 Li. 126 | 19:8 | 11:5 115 | 108 | 75 | 55 ‘8 | 14 M 923/19 | 141 | 74:2) 18-7 | 14 — | 82°6| 11-4] 12 5 s 4 | 1053 
oto} Kwakinlt | 24-9 |23 | 12-9 | 117 | 10 115 /106 | 8 | 56 Sloe | a | aa3|201 |158 | — [203 |15%6 | 768) 781) 28 12 5 M 1 | 1292-4 
Psa No Ves Comstrg | 255s 1 215) HBG Lea H0;5 ey | 93 | 46 32 |26 | R go | 18:3 | 14 — |1483 | 14:4 | 78:7 | 73°5| 91) 11 5 s 1 | 120°9 
99/1619 | Nanaimo Y.C. | 23 125 | 11 | 96) 8 | 48 35 [33 | N ra-7 (20-2 | 148 |76-5|19°7 [146 | — |808) 82/113) 5 M 1 | 137°8 
99/1620 3 3% | 13-7 | 123 | 10 he WM Ne || ot 23 |3 | y05-3 {18-3 [13-6 | — |18-9 | 138 |730| 822/10 )113)  } M i lane 
aaitece 3 135 | 11-4 | 10 ue 10% | 75 | 54 34 |3 M 95°5|20°8 |14°6 | 70-2) 20:4 | 145 | — | 80 9°8 | 115 5 M 1 1173 
oie ” 126 | 103 Lae eat || BPG 32 23 | R polar lite |793[178 |141 | — [746] 93) 116) 5 $ 2 | 124-7 
gojsesy | N-Sami 135/12 | 10-4 125 | 115 | 85 | 5:5 a7 loa | Rp | ee-5|20-7 | 158 | — |20°8 | 15° | 76 | 748) 105 27| 6 M 1 | 10-4 
5/2687 |/.N; Samich, B/C: 129 | 113 | 9:8 11/102 | 7 | 58 33 118 | RB | 887 | 185 | 147 13.9 |14:9 | 78-8|75-9|10°6|12°5| 5 M 2 |1179 
pares apy tpl ar 2 E 13-8 | 12 NE Oe 62 25 | 25 M Bi1 {186 | 14:9 |80-1|182 | 141 | — | 744) 97 10°3 5 Ss 2 | 106-2 
nalaeae Victoria, B.C. | 26-5 13:1 }12 | 10-4 | 115 | 105" | 8 | 63 a IS R gi-4| 20°1 | 15°88 201 | 15-8 | 78:6 | 77-6 | 11-6 | 12 6 Ss 2 | 103-4 
BS/2682 5) Des Gamichy 9) 3 |122 /11 | 15 |109 | 7 | 46 3 a3 | mp | a7ali9a|145 | — | 192 | 14° | 75:5] 72-4) 93) TT 5 M 1 | 126-9 
Fee ama 10°83 | 9:5 10:5" | 10-4 | 7-2 | 61 ag ja1 | R : — |a0-8 |14-8 | 71:2|83-2| 91/17) 5/6 M 2 | 128-5 
B22 ASS | iallala sOree 24 12-9 | 11-4 | 9-9 125 |106 | 9° 15 35 |23 | M 74-9| 18-3" |13°6 | — | 762] 82)107) 6/5 M 3 | 180-5 
U.S.N.M. 12 | 101 | 88 | 38 31 |18 | RB 75 (186 |14:1 | — |73°5| 96/116] 5 M 1 | 120°8 
215 *60 Arizona 
216 zone 118 | 11+ Sealer | 
eiAllxtias » Fey lee peters cereale eel Sa ag | nw | ra6\1e1 |13_ | — |18°6 | 128 688) 76-4) 97) 1 5 s 3 | 1s 
” 12-4 | 11 9 2 726 sta 95 | 88 | 65 | 4:8 4 | 25 M 838/191 | 14:2 | — | 19:2 | 14:2 | 74:5 | 73:8 10:8 | 11:7 M 1 108°3 
= zy 107 | 9 Oe 63 | 55 a4 N 76.6|195 14:5 |74-4| 19-4 [14-7 | — | 72°2| 10-2) 10:9 6/5 M 3 | 1068 
| = = 
Averaget 25-76 N-44 4 segs. =3 1st =90 
; = 25°76 | 22-66 | 12°95 11-78 | 10-6 Seri llorqaee E | 
Largesttt ai (kaa 10;68)1)-2°76)/(-79775) bo} 1.0,82) 8572) a oo aes 3-95 (2-81 || W-as | 84-26| 19-27 | 14:52 | 746 | 19-32 | 14-9 | — | 752/10 | 115 4s= 2) $52 |2nd—s9 1158 
ike = 291 |27 |14-7 | 14:9 avs ore} ; | | 
Smallest 20°8 14 | 3-2 |1o7-7]1466}14 | 12-75] 11-7 | 75 a5 \43 |X x0-92 1089 | 22-2 |16-9 |et1|225 |1¢8 | — |90 | 132) 137 5=169| M146 | 3rd=49| 174°6 
; =a 20°8 |17-2 |10:3 | 94 | 75 
W5" | 10:8!) 6175)), 19) 19) | Bi | Mem ere ge (i Watered) ee lie 21-3. |\e7-8)] 1677 | 1145) — |/63:7)))9673))) 9:2 5/6= 11| P18 |4th=16) 86 
es = R-71 G= 32 \reversed 1) 5th=3 
188. “Male type.” 189. S - u 
Ad . Sacral canal } : : A 
somewhat asym 3 vi se open all but 0-1 em. in id ; 
192, Flat SHCA pORaIcE 181. Six sac. vert., one from coccyx 7 cates ty 0.” Ae: Narrow, 199. ‘Transitional vertebra, free on right, one from coccyx. 200. Narrow pelvis. 207. Six sac. 
promontory. TOGIRISRKIEReteerE ven j_rensitional vertebra, left sacral, right uaabae ieee outlets vert., all sacral. No false promontory. 210. Six sac. vert., all sacral. No false promontory. 
iene rom coccyx (2). YO 212, Transitional vertebra, right free, but articulates with ilium, left sac. bee promontory. 7 213. Tran- 
ix sac. vert., one from coccyx. 


sitional vertebra, left free, right sacralized.  F ‘alse promontory. 216. 
217, Transitional vertebra, right sacralized, left free. No false promontory. + False promontory =18. 


{ Double promontory =2. 
Biometrika 1x 


ee 198. Six sac. vert., all sacral character. 
riginal number. Specimen not yet permanently catalogued. 
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THE INTENSITY OF NATURAL SELECTION 
IN MAN. 


SECOND PAPER. 
By E. C. SNOW, M.A., D.Sc. 


THE present paper is a supplement to the memoir of the same title issued last 
year*, It is not proposed to give here any account of the work which has been 
done in the attempt to elicit information on the more difficult subject of the nature 
of selection in man, but only to publish the correlations and regressions obtained 
by using an alternative measure of environment, and by varying the periods in 
which the effects of a selective death-rate can be detected. 


The adoption of another method of correcting for environment implies not the 
slightest shaking of my confidence in the adequacy and validity of that employed in 
the first memoir, but the further work was entered upon because the importance of 
the subject renders the comparison of the results reached by the use of the various 
possible methods particularly desirable. The only criticism I have seen of the 
mode of measuring environment used in the earlier work is that by the Editors of 
the Journal of the Royal Statistical Society. Had there been a tithe of evidence 
supporting the view adumbrated in that criticism the memoir would have been 
practically valueless. Fortunately, however, no arguments whatever have been 
put forward in favour of the view held by the statistical critics, and very cogent 
facts against that view have already been givent. It is quite beside the point to 
show that the corrected standard deviation of the total mortality in the two 
periods considered is only 5°/, or 6°/,. That standard deviation is in a number 
of cases appreciably of the same magnitude as the corresponding measure of 
dispersion in the earlier of the periods used, and, moreover, is many times its 
probable error. It matters not whether that standard deviation be 6°/, or 06°/, 
or 60°/, of the mean value. 


* Drapers’ Company Research Memoirs, ‘* Studies in National Deterioration,” No. VII. Dulau 
and Co. 1911. 


+ Biometrika, Vol. vit. p. 456, 1912. 
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The notions used to indicate environmental conditions for this new work are 
very simple. Shortly, they consist in making the mortality of a cohort of one sex 
the measure of the environment for the corresponding cohort of the opposite sex 
on which it is desired to ascertain the possible effects of selection. Thus if we 
wish to investigate the selective effect on the male mortality of the third, fourth 
and fifth years of life of variations in that mortality in the first two years, in 
addition to fixing the size of the male cohort we fix the size of the corresponding 
female cohort and also the total female mortality in the first five years of life. We 
can thus suppose that we are dealing with districts in which the female mortalities 
up to five years of age for the cohorts born in a particular year are the same. For 
these districts we find varying male mortalities in each of the periods considered 
(see Table below), and the mean values of these male mortalities in both periods 
throughout the whole series of districts can be found. Do the mortalities in the 
second period of those districts whose mortalities in the first period deviated in 
the positive direction deviate, on the average, in the positive or negative direction ? 
Districts with the same female environment will possess varying proportions of 
male weaklings. If these weaklings are killed off in the earlier period, the popula- 
tion which survives to the later one is stronger and likely, therefore, to have a 
smaller mortality, and this would be indicated by a negative correlation between 
the mortalities in the two periods (with the proviso dealt with in § xxxtv of the 
memoir). To the criticism that the total male mortality is highly correlated with 
the total female mortality, and that by making the latter constant we are 
practically fixing the former, we can reply by pointing to the considerable 
standard deviation of the total male mortality when correction is made for 
constant female mortality (see Table below). Evidence of a more general 
character, too, can be gathered by turning over the leaves of any of the Registrar- 
General’s valuable Decennial Supplements to his Annual Reports. Pick out a 
few of the registration districts in which the mortality of one sex for any of the 
age-groups given is practically the same and compare the mortalities of the other 
sex among those districts for the same age-group. Quite appreciable variation in 
the numbers will be found to exist. Not many of such districts can actually be 
found, but the method of partial correlation essentially consists of a contrivance 
by which we can for statistical purposes reduce all districts to a constant type. 


No method of measuring environment can be theoretically perfect. Districts 
under the same environmental conditions would not have the same mortality, but 
the latter would be distributed in some way due to random causes. Of two districts 
under the same general environment one may in a particular year suffer to a 
greater extent from epidemics of measles, scarlet fever or summer diarrhoea, and 
part of the problem of selection consists in ascertaining if these epidemics strike 
more at the weaker children than at a random sample of all children, and this is 
ascertained by inquiring if the surviving population is more immune in a subsequent 
period. The two methods which are employed in this paper to measure environ- 
ment are quite distinct and bear no physical relation to each other. In the one 
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method the mortality of the cohort corresponding to the cohort whose history is 
being traced but of the opposite sex, is fixed. In the other, the number of deaths 
of the same sex and between the same age limits in the period under notice, 
apart from the deaths occurring within the cohort, is rendered constant. Neither 
of these, of course, allows for the fact that under a perfectly uniform environment 
the mortality which is taken to indicate that environment should be distributed in 
some way due to random causes. But the general similarity which will be shown 
to exist between the results reached by using the two distinct methods is some 
justification for the claim that each is a satisfactory approximation to the theo- 
retically best method. 


The investigation which is now being described was directed throughout to 
ascertaining the extent of selection in the mortality of the first two years of life. 
In the earlier work of which an account is given in the memoir other periods were 
taken, but later some evidence was adduced to indicate that the first two years 
of life was a natural interval to adopt, as embracing roughly the whole of the 
mortality of infancy and overlapping but very little that of childhood. For the 
second period, on the mortality of which the selective character of that of the 
earlier period is indicated, the next three years of life are taken both for the 
English and Prussian data; in the case of the latter, also, the next eight years are 
employed as a second period. We thus reach results obtained from English and 
Prussian data by working at the same periods in each case, and the comparison of 
these results is of interest. The notation employed throughout is: 


x) = Births of the male or female cohort considered, in, say, year t. 
x, = Deaths in the cohort in the two years, ¢ and ¢+1. 
#,= Deaths in the cohort in the next three years, or next eight years. 


x, = Remaining deaths of same sex as cohort in the five years or ten years (see 
Memoir, § vit). 


w,= Deaths in the corresponding cohort of opposite sex in the five years or ten 
years considered, 


a; = Births of the corresponding cohort of opposite sex. 


Previous experience suggested that the correction for a constant value of a; in 
addition to constant values of 2 and «, would have little effect on the correlation 
between a, and a. The first case worked out supported this view; in that (the 
Prussian male cohort of 1881, dealing with the first ten years of life) the correlation 
was only altered in the sixth figure by the extra correction, viz. from — ‘944206 to 
—'944209. Thus the considerable labour involved in making this further correc- 
tion is not justified by the extra value it gives to the results, and in all the other 
cases it was omitted, 


The partial standard deviations, correlations and regressions with their probable 
errors for the various sets of data are given below, the other standard deviations 
and correlations on which they are based being shown at the end. In the table 


61 


EK. C. Snow 


+691-T— | 


9¢0- +496-— | 6&0- 
(211. + = 
| 9FO- - 
LLO- pon 
| & 96 | 
| F- 86 | 
| or I6L | 
| 9g. PSL 
Se: FI | 


€90- +6L6-- | G10. +8LL-. - 

+1Z6-—-— |€10- +929. —- 

6G0. — 

990. +FT89.— 

. 160. — 

cho. +16L--— | 810. +126. — 

6-3 + 98-101 O-OLF GEIL | 

€-8 F 0.66 €9 F 9-ZL 

8-EL+ 6-8CI SELF SSSI 

9-GLF  6-CCT 6-014 9-FZI 

6-OL+ 9-61 9-1IL+ 0-€&T | 

8-LL+ 6-FST 8-FL+ - 1-691 
cog OLF 
FOOS FIGL 


610. FEIT. — 
| $CO. +ZL0- — 

690- — 
611. +ZLT- — 

G60: — 
| O@I- +OQ9T- — 
GE €.96 


Le+ Lage 
GI+ 6.91 
| €.14 61 
GG QS 
| O€+ GFE 
eq 
| LEP 


60l- +TSTI-—- 80. 
660. +LLG-— | FFO- 


| 


170. — 
OG. PPL - 
890. — 
160: +60G- — 
LG G0 | 
GZ 9.8z 
6-14 1-62] 
91+ 0-81 
G+ FFG 
6-GF GEE 
86 
€¢ 


OOl- 
6-G+ GEE 
CE 
2-06 
F-I+ 0-91 
9-6+ 
GEF 
F6 
C8P 


F-9€ 


8.63 | 


Oyo) T8sT 


WOYOD [FRET WIOYOD TS8T 


OOD GLET 


q1OYOR) TL8T 


IOYOD OLET 


(suvaX QT) Vavq NVISsoug 


(suvax ¢) yavq NvIssaug 


VIVQ HSIIONG 


‘SayDUa 
v10- +189-— |O10- ¥669-— |F10- +6E9-— |ZI0- +FG9.— | ILO. +E0-— | G20. FILT-— | 90. Fezz-— | Fah 
SOI. +ZP8--— | 810. +FL86-—- | 190. +FL1G8-- | FIO. +ZEL-— | PFO. +H80-—- | O€O- FIPT-— | 8E0- FOZZ-— Heqge 
68. 6EL-= SOL. 860. — | a4 POL. — 890. — GlgtO 
610. +0Z6-- | €1O. +PPE-— | OZO. FLI6-— | 110. FLZ-— | EZI- FECO-— | 960. FLOF-— | GOL. FGTP.— | “ale 
LLO-— | 960. — 690. — 480. — | 60L- — 261. — 880. — | 2tae0 
810. +G09-— | L10- +¥8%6-— | LFO- +F98L-—- |810- +B6-— | LIL. +FZZ-— | L60. F9OGP-— | OGO. FIZG-— | tae 
G6 + 9-901) 88 £ 9-001) 9-6 # G-OIT F-8 + ¢.96 O-2+ 6FE| Ler 6aE LG  G.T@ | &* toro 
GFI+ 0-991] 69 + L8L | 86 F LIIT| O04 ¥- 6.08 cee 0.07| Ger 9.6@] Lee e.cg | st Lot0 
LGL+ F-081 ¢.91+ 4-681 LE€L+ G-LE1 PEL G.PGL| 9-1TF OSL] &IF BFL! 9-14 G-QT| S07 
6-L1+ F-906| F-81F 8-O1G| FEL LOLT 6-€L+ 9.6¢T ss thee ates ISL GIF LT | 208 
€-1GF LS E+ 1-9GZ L614 9.G%Z I-614 6-812 LG 6-08 PEF G-68 O-€F PE | bof 
631+ F-8hL| ZLIF O-86L| PIF 6-191] ¢-L4TF 1-103) ¢-€F 9-0F| 98EF LPP] 9.84 LTP] 108 
GEL 61 FIG OsF ZI I0L 16 ey 
PCVG OLGG TEVG OLZS BEG G9G 869 My 
WOOD GB8T yoyo) T8st HOO) G88T yt0YOD T88T | SERS) GLST qtOTOD TL8T WOOD OLET 
(suvaX OT) YIYq Nvissaug (suvax G) vavq Nvissaug VLIVQ HSIION 


*SO]D YT 


62 The Intensity of Natural Selection in Man 


on p. 61, .;0,4, denotes the partial standard deviation of the total mortality in the 
sum of the periods considered, .3¢. denotes the expected (partial) correlation if 
there were no selection (see § XxxIv of the memoir), and ,;b,, denotes the (partial) 
regression of the mortality of the second period on that of the first. 


Dealing first with the results from the English data, we notice that the regres- 
sions by the two methods for males are fairly similar, but for females they are, on 
the whole, smaller by the new method. Having regard to the probable errors we 
can draw no inferences concerning the differences. The correlations for females, 
however, are in two of the three cases considerably smaller by the new measure of 
environment, and this difference appears to be significant. The partial standard 
deviations by the two methods occasionally show fairly large differences, but in no 
single case is the disagreement significant. For all the six cases, however, .o is 
less than ,;0, but go, is greater than ,;0,. The mean of the male regressions by 
the first method of measuring environment is —‘142 and by the other —°148, the 
corresponding figures for females being —‘179 and —‘117. The mean regressions 
of the mortality of the 4th and 5th years of life on that of the first three years 
are —'085 and —‘172 respectively*. Thus, so far as males are concerned, the 
intensity of selection appears greater when measured by the regression of the 
mortality of the 3rd, 4th and 5th years on that of the first two years of life than 
when measured by the regression of the mortality of the 4th and 5th over the first 
three years, but the same conclusion does not so definitely hold for females. It may 
be, as was suggested in the memoir, that the age division between infant and child 
mortality is not the same for females as for males, and the inference is put forward 
tentatively that the ailments of infancy (as distinct from those of childhood) 
attack females to a rather greater age than they do males. It will be noticed, too, 
that the regressions and correlations for the 1872 cohort are smaller than for the 
other cohorts, and that this is accompanied by the fact that the mortality of that 
cohort was smaller in the first period. On the whole, so far as the data for the 
English rural districts are concerned, the adoption of a new measure of environ- 
ment leads to no alteration of view as regards the existence of selection, nor, 
roughly, of its numerical intensity. 

When we turn to the results from the Prussian data in which the same periods 
(the first two years and the next three) are used as for the English data, the 
most marked feature to be noticed is the considerably larger correlations and 
regressions which are obtained. The mean value of the regressions for males 


* I take here the opportunity of correcting a mistake which occurred in connection with the work of 
the first memoir, through an error in transcribing from the schedules containing the raw data. In 
§ xvi, in the portion of the Table for the 1872 cohort referring to females, the following alterations 


should be made : 
TAGs Oy NO, 


791 = '812328, r= “718137, 3 944379, 3ror = + 464859, 32> - -168670, 103712 = — 184068. 
These entail the following corrections in the Table on p. 33 for the same cohort: o30,=34'844 and 
partial regression= —-0591. These alterations reduce the correlation and regression but make the 


results more consistent, and necessitate little modification of the conclusions drawn from them. 
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is — "794 by the first method and —°647 by the second, while for females the 
figures are — “798 and — ‘878, these comparing with corresponding values for the 
English districts ranging from —‘12 to —‘18. Thus the criterion which we take 
as the measure of the intensity of selection was for the Prussian cohorts of 1881 
and 1882 about five or six times as large as that for the English cohorts of 1870, 
1871 and 1872. We can assert with some confidence a considerably greater 
selective effect of the mortality of the first two years of life on that of the next 
three in the case of Prussian rural districts than of the English rural districts in 
the epochs considered, and this fact is concomitant with a far greater stringency of 
infantile conditions in the former than in the latter. This is seen from the follow- 
ing figures. 


Mean Number of Deaths in | 
Mean Number of Deaths in | the next Three Years divided 
First Two Years divided by | by Mean Number of Births 
Mean Number of Births minus Deaths in First Two 
Years 
Male Female Male Female 
1870 "185 157 ‘037 036 
England } 1871 174 146 ‘038 ‘037 
( 1872 164 "139 ‘O41 041 
Prussia | 1881 241 214 069 (*102) | -067 (102) 
7 1882 ‘260 "228 ‘075 (°107) 074 (107) 


The figures in brackets give the corresponding numbers for Prussian districts 
for the mortality in the eight years following the first two. 


The data for English rural districts do not allow us satisfactorily to follow the 
cohorts beyond their first five years of life, so that we cannot assert that the 
intensity of selection 1s generally less in England than in Prussia for the popula- 
tions considered, but we can point out definitely that the effect of selection in the 
jirst five years of life was much greater in the latter country than in the former. 
Whether or not the English cohorts make up the leeway at later ages can only be 
a matter of speculation. We can at present merely state that whereas a district 
in England which had an excess of 100 male survivors above the mean for all 
districts at the end of the first two years of life had, on the average, about 14 of 
these survivors killed off in the next three years, a similar district in Prussia lost 
more than 70 of the 100 in the same period. 


When, for the Prussian data, we come to deal with the results of including the 
eight years following the first two, we find that the regressions for the 1881 cohort 
have increased appreciably, but those for the 1882 cohort have not done so. There 
is nothing incongruous in this, as in one cohort selection might well be felt more 
in the 8rd, 4th and 5th years than in the other, and in this latter the effect would 
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then probably come in later years. The tendency of the partial standard devia- 
tions for the Prussian data is opposite to that for the English, viz. for males co, 
is greater than ,o, and go, 1s less than ,3;0,. For females, however, these are 
reversed, except for the 1882 cohort in the ten-year period. Another feature of 
the Prussian results is that the regressions for males are smaller by the new 
method of measuring environment, but for females the reverse is the case. This 
is also true for four out of the six examples from English data, and arises chiefly 
from the differences in the male and female variability in mortality in the second 
period compared with the first. In the first two years of life the (partial) standard 
deviation for males is always greater than for females, but for the second period 
the female (partial) standard deviation is in some cases the larger, the mean 
mortality in this second period being about the same for the two sexes. 


In the memoir (§ xxIv) a short discussion is given to the question of what 
amount of correlation between the mortalities should be expected if selection were 
entirely absent. This has been referred to as ,@. and ,@,. in the present paper. 
These values are only intended as approximations, and it would undoubtedly be an 
advantage if by direct correlations we could obviate the use of such corrections. 
These direct values could be obtained by correlating the mortality rate of the first 
two years of life (based on the number of births) of the cohort with the mortality 
rate in the 8rd, 4th and 5th (or 3rd to 10th) years of life (based on the number of 
survivors to the age of two), correction being made in some manner for a constant 


environment rate. ‘This would entail a correlation between such variables as — 
y 


and 


i and in my opinion might involve an element of ‘spurious’ correlation, 
xv 4 


and for this reason alone rates were not used in the memoir. So far as I can 
understand, however, the critics of the memoir do not hold this opinion, so that to 
them the corrected correlation between two variates of the above type is probably 
as satisfactory as a partial correlation of the third order. The employment of such 
correlations saves considerable labour and requires no discussion of the question 
of ‘expected’ correlation if selection were inoperative. Accordingly, for a few 
cases, the following new variables have been taken: 


2, = Male or Female Deaths in the first two years of life divided by Male or Female 
Births, 
z,= Male or Female Deaths in next three (or eight) years of life divided by number 


of survivors to the age of two, 


z,= Total Female or Male (i.e. of opposite sex to z, and z,) Deaths in the whole 
five (or ten) years divided by Female or Male Births, 


and the values of .7%, worked out. The statistical constants on which they are 
based are given below: 
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PrusstaAn Rurat Disrricrs ror 1882 
Enexisy Rurau Districts | _ E = : 
For 1870 
Five Years Ten Years 
Male Female Male Female Male Female 
| 
Zp 184 “156 241 *209 ‘OAT *209 
Ay 037 ‘036 ‘069 ‘069 ‘100 "103 
Zo "187 ‘214 264 292, “290 coll 
oo ‘0201 ‘0184 ‘0541 “0508 ‘0541 *0508 
onl ‘0079 “0092 0263 "0241 ‘O291 0253 
a 0232 “0219 ‘O611 0653 ‘0595 -0647 
To1 271543 °463227 634918 633825 ‘596721 =| ‘576631 
102 *854400 877328 ‘951975 “975109 937904 | 967319 
119, "528884 °567130 "817686 *755164 °811763 | °723951 
Foi — ‘409 — 087 — 814 “705 — 813 Od 
200 ‘0104 ‘0088 ‘0166 ‘0113 ‘0188 0129: 
971 ‘0067 | ‘0076 ‘O151 ‘0158 ‘0170 ‘O1L74 
2D40 — ‘262 — ‘075 — "744 ‘990 | — 734 — ‘958 


If the values of .7, here are compared with the values of gry, in the earlier 
tables for the corresponding cases we find very little difference. Thus : 


Correlations obtained without using Rates, 


. —415 -—'158 -—‘917 -—-768 -—-920 —-789 
Corresponding Correlations by use of Rates, 
—'409 —087 -— 814 -—'705 -—'813 -—-707 


If we reduce the figures in the first of these by the values of 4¢. we have the 
sequence 
— 347 -—105 -—-809 -—-710 —-781 —-701 
The agreement between the last two lines of figures is surprising and remarkable. 
In all cases we get approximately the same numbers as before, a quite unexpected 
result, 


Lest these results should hastily be pointed to as evidence against the possi- 


bility of spurious correlation arising when such variables as — and 


are 


correlated, it should be pointed out that the deviations from the mean values of 
the variables are in some cases considerable, and that the third and fourth powers 
of those deviations cannot be neglected in comparison with the corresponding 
powers of the means. Thus the formula which is usually exhibited to show the 
possibility of spurious correlation does not apply to this case. Whatever the 
magnitude of the ‘spurious’ element, if any, involved in the correlations just found, 
they can only be construed as supporting those previously found and as evidence of 
the existence of selective mortality in the populations dealt with. 


Professor Pearson, also, has pointed out to me another possible mode of 
attacking the problem. This is to render constant 7,—«, instead of 2, and in 
Biometrika 1x 9 
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addition to #, or #,, as before. This would fix, not the number of births, but the 
number of survivors at the end of the first two years. Then the population liable 
to mortality in the second period would be the same for all districts, but the 
districts with the larger number of births would be exposed to the possibility 
of greater mortality in the first period, and therefore to the possibility of greater 
mortality of the kind which is taken to measure environment (the mortality in 
the first two years being much greater than in the next three or next eight). 
Thus an ‘expected’ (negative) correlation if selection were absent would not arise 
in the same manner as before, but would probably be entailed in the partial 
correlation. A priori, however, it does not appear that this method would pro- 
duce a correlation, if selection were inoperative, of the same magnitude as that 
indicated in § xxiv of the memo1, since the population rendered constant stands 
in an intermediate position to those at the beginning and end of the periods 
considered. The correlations under these new conditions have been worked out 
only for the case of the 1884 Prussian cohort, both male and female. The results 
differ but very little from those reached before, and must, I think, be taken 
as supporting the substantial accuracy of the interpretation put upon the earlier 
ones. If #; denote a — a, the following are the additional correlations: 


| Males | Females 
Os 3117°6 31751 : 
len ceepanedl “924739 | ‘928746 
rn | "847138 ‘851981 
153 | ‘924711 | ‘942791 
54 “927020 | ‘934079 
aie | ss908 | ee yay 
ene ~ -780 — -806 


and —°768 and —‘797 for females. It appears therefore to be of little account 
whether we make a, or «,—, measure the size of the populations; we should 
probably, too, get similar results if we put #) — #,—@», i.e. the size of the cohorts at 
the end of our survey, constant. 


On the whole, the work of which this paper gives a short account has justified 
itself by the confirmation and emphasis it gives to the results previously obtained. 
The general impression received by a study of the results reached by the employ- 
ment of the new method of measuring environment alone is much the same as 
that derived from a survey of those by the earlier one, though individual differences 
of appreciable magnitude occur. Apart from the emphasis it gives to the results 
of the memoir, the present work has discovered, I think, a significant ditference in 
the operation of selection on the mortality of the first five years of life in Prussian 
and in English rural districts, and suggests (but, at present, no more than suggests) 
that there is some differentiation in its effect upon the two sexes. But the 
existence of a selective death-rate in the general populations dealt with admits of 
no doubt. 
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List oF STATISTICAL CONSTANTS ON WHICH THE PARTIAL CORRELATIONS 
AND REGRESSIONS ARE BASED. 


English Rural Districts. 


1870 Conorr 1871 Conort 1872 Conorr 
Males | Females | Males Females Males Females 
| eee | Bese: co! a 
Xp S227 3090 3226 3114 3291 3150 
ais 2669 2240 2635 2205 2634 2189 
Vy 579 694 550 663 548 650 
ov 488°3 467°8 506°6 480°8 520°3 | 477°0 
o1 127°7 105°6 130°0 102°6 126°0 107°4 
v9 27°5 30°6 231 28°4 330 31°3 
03 581°0 508°0 582°3 527°2 605°9 545°1 
o4 128°6 146°1 120°7 145°8 13355 153°8 
or “884086 *856073 | *850467 *834253 *793680 *814537 
702 637792 643144 *609366 5134388 OFAC 3353 662323 
103 "852236 “881001 835223 "797029 *807817 "752425 
704 *849348 *881602 *838168 *836549 °824623 | °777569 
12 °608255 "688624. 638919 "581274 "804386 | ‘791884 
113 *930579 "925657 931412 “934695 "945229 | "932208 
14 *952680 "958944 ‘947168 ‘970562 °969370 | *965413 
193 °775362 *845435 “799258 *757490 "886385 | °874839 
794 "706226 | °749785 ‘77655] °627105 *838194 *842120 
301 +°475208 |+°412648 |+°362379 |4+°415864 (+°156490 + °475066 
01 +:467000 |+:°079683) |4+°323442 |+°169218 |—-040919 +°389982 
3702 — ‘069637 |—:199985 |—:176081 |—°2290385 |+:°000352 + °012762 
402 +°101579 = |—°057213 |—°120807 | -—°026164 |+°080835 = + °022190 
sl12 — "490155 |—-464969 |—°+482395 |—°'546204 | —+221368 = *135040 
412 —°299955 |—:'161886 |—°478055 |—:145896 |—:060726 | -+150136 
: | 
Prussian Rural Districts (Five-year period). 
| 
1881 CoHorr 1882 Conorr 
Males Females , Males Females 
Xo 9407 8917 9297 8793 
#3 11723 10083 12047 103863 
Ha 2384 2760 2510 2938 
om 4586°9 4398°1 4689°5 4504°8 
onl 1502°2 1276'4 1654-1 | Nepal 
v2 343°2 o24°2 375°6 | 360°7 
3 7708 '1 6734°5 8044°8 6997°8 
o4 1558°6 1794°5 G2, 1981°5 
To1 "964196 968495 ‘967685 *965286 
To2 *844586 *846971 “860460 | "869316 
103 ‘964747 “961005 967523 ‘967466 
Toa *963378 “9TO151 "9638954 "973849 
12 *817746 "844333 "841894 *864055 
113 "990456 “Q9OLOL *995000 "995279 
Ty, | ‘988301 994536 "989235 "995822 
193 *884689 903554 ‘8819388 ‘9OLTII 
oq °891353 *882250 ‘906736 “897541 
gn ~=—S«|:«Ss + °238636 + 308402 + 198035 + 097214 
ao =| «= + 295597 — *053527 4362344 | —-216609 
372 — ‘072672 — °196905 + 060222 — 027966 
a2 — ‘116186 — 078351 — ‘121140 ees “047446 
"19 — 910479 — ‘919631 | —'756884 — *795986 
4719 — 913825 — 673466 — *892589 | = *738506 
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Prussian Rural Districts (Ten-year period). 


1881 Conorr 1882 Conort 
Males Females Males Females 
| a = 

io 9407 8917 9297 8793 

i 28277 24650 28137 24480 

te 2625 3000 2733 3156 
a 4586°9 4398°'1 4689°5 4504°8 
C1 1502°2 1276°4 16541 1397°1 
o2 461°0 445-0 470°7 449°9 
3 18056°8 15723°8 18015°8 156452 
o4 1672°8 1901°3 1806°8 2073°1 
Tor *966797 °963495 *967684 965285 
709 °867956 *871859 *895378 *913858 
703 *971578 “968570 "975529 "978221 
To4 *969132 *973037 ‘968885 ‘977710 
T9 *825960 "854617 "860826 *883242 
13 *991230 “991070 995812 "995626 
14 ‘983883 "993559 “988210 *994336 
193 *889151 ‘909771 *890463 “911441 
oq "909835 "902419 923618 *921165 
37°01 +°119558 + °107783 — 186994 — °221147 
401 + °301348 — *125303 +:271614 + '161865 
3102 + °037627 — 090243 + *266908 | + :260740 
403 — '134827 — 062670 + 005249 — 308612 
3112 — ‘916019 — 849620 — 622720 — 629876 
alo = *932723 — ‘860001 — *884404 — °790688 

Also for the 1881 male cohort : 
te = 999225, M%s= ‘965417, 1%,= 866632, 


wus = + 987800, 
ws = — 013340, 
ste = — 944209. 


T~—= 967678, 
sos = — ‘131819, 


ys = +'295690, 
ost’5 = + 008833, 


ON ERRORS OF RANDOM SAMPLING IN CERTAIN 
CASES NOT SUITABLE FOR THE APPLICATION OF 
A “NORMAL” CURVE OF FREQUENCY. 


By M, GREENWOOD, Junr, 
(From the Statistical Laboratory of the Lister Institute of Preventive Medicine.) 


I. 

Introduction. 

THOSE who believe that the more closely a branch of knowledge adapts itself 
to the principles of quantitative reasoning, the more justly it merits to be ranked 
as a science, have been gratified by the improved standard adopted in the treat- 
ment of medico-statistical results. It is true that even now medico-statistical 
writers fall short of the attainments regarded as essential in some other depart- 
ments of natural knowledge, and that a few prominent investigators vaguely 
denounce “ mathematicians ”—by which term is to be understood any one trained 
to employ modern biometric methods—as presumptuous intruders within the 
sphere of experimental medicine. Despite these obstacles, progress has been 
marked within recent years and we may have considerable confidence that future 
discussions as to the value of such procedures as vaccination or the determination 
of an opsonic index will be conducted with due regard to the claims of exact 
science. 

It is, however, in the nature of things that a reform of this magnitude should 
be accompanied by certain disadvantages which tend to impede the march of 
ideas. For instance, reformers may urge that the employment of certain argu- 
ments requires for logical validity the application of some specific test. After 
much discussion, the point is conceded and then the test is in danger of being 
applied in other and unsuitable instances. 


The particular illustration which has prompted these remarks is the employ- 
ment of some consequences of the current theory of errors of random sampling in 
certain cases which frequently arise in medical and pathological work. At one 
time it was customary to base conclusions as to the efficacy of some method of 
treatment upon short series of cases without any statistical test being employed. 
A practitioner might find, for example, that of 100 cases of typhoid fever treated 
without any special precautions as to diet, six had died. Of a subsequent 100, 
dieted in a particular way, but two succumbed and a conclusion very favourable 
to the new method of treatment might be ventured. Owing to the partial 
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permeation of medicine by quantitative methods, it is widely felt that this proce- 
dure is questionable and that the “probable error” of the result must be found. 
The medical writer who has attained this level accordingly refers to a text-book 
and tests his proportions upon the basis of a “normal” curve of errors with the 
binomial standard deviation J npg. 


The specialist in mathematical statistics is aware that the time-honoured 
theory of the “probable error” rests upon certain assumptions of a quite definite 
character not adequately fulfilled in the imaginary case described. Warnings as 
to this are given in the better text-books, and are indeed unnecessary for those 
who care to read the proofs of the usual formulae. 


We must, however, bear in mind that not every medical man has either the 
time or the training requisite for the comprehension of mathematical analysis and 
many will be inclined to consult a book which, while giving formulae without 
proofs, contains explicit instructions as to their practical employment, Such a 
book as, for instance, Professor Davenport’s Statistical Methods, seems admirably 
adapted to the needs of the laboratory worker. On p. 14 (2nd edition) he will 
find the following sentence :—‘ The probable error of the determination of any 
value gives the measure of unreliability of the determination; and it should 
always be found.” The statement is commendably clear but, unfortunately, quite 
incorrect in many cases which come under the notice of the medical inquirer. 


The present memoir is an attempt to make the limitations of the process 
recommended by Professor Davenport arithmetically obvious to the medical 
reader, and to provide the latter with some assistance in the exceptional cases, 
To the trained mathematician or biometrician I have nothing to offer which is 
novel and little which is of interest, while the medical reader may find some 
difficulties in following every step of the inquiry. I hope these difficulties have 
been reduced to a minimum, but a risk of falling between two stools has to be 
faced by any writer dealing with a subject not new in itself but relatively so in its 
applications. My biometric colleagues will recognise the difficulties of the task, 
and are alone competent to determine the measure of success or failure achieved. 


IT. 


The chance of an event happening is p and of tt failing, (p+ G=1). What rs 
the “probable error” of pin successes in m trials ? 


In the problem stated the probability for the occurrence of an event and the 
independence of the happenings are supposed to be known. This ether means 
that they have been ascertained by long experience or that their values (i.e. the 
value of p and the zero correlation between the results of successive trials) are 
defined by an hypothesis which we desire to verify. Accordingly the distribution 
of successes in m trials is given by the expansion of the binomial (p + q)™ If m 
be moderately large and px gq small, the ordinates of a “normal” curve with 


Standard Deviation Wmpq are a close approximation to the terms of the binomial 
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and the “probable error” of mp is + 67449 V mpq.- This is the classical text-book 
case. Its limitations are obvious. If either p or ¢ be very small unless m is very 
large indeed, and for all values of p and G when m is very small, the normal curve 
does not approximate closely to the binomial. 

Consider this problem. A certain bacillus is stated to occur in the mouths of 
2 per cent. of all normal persons. Twenty persons have been examined and _ the 
bacillus was isolated from two of them. Is this observation consistent with the 
truth of the hypothesis ? 

Let us find the chance that in 20 trials two or more successes would be met 
with if p= ‘02, g='98. 

By direct calculation we find this chance to be about 1 in 17. If we use a 
“normal” curve with Standard Deviation ,/20 (02 x ‘98), the chance proves to 
be rather less than 1 in 25, or the probability determined in this way is only 
68 per cent. of the real value. Of course when the number of trials is so 
small we could not expect a continuous function effectively to represent the 
binomial expansion, but even for m large the inadequacy of the “ normal” curve, 
in the case of P~@ not small, must be insisted upon. I think the best way of 
making this clear arithmetically is from a consideration of the moment coefficients 
of the binomial (p + q)”. 

With the ordinary notation we have :— 

fb, = OMDpq, 
Ms = cm pg (Pp — q), 
fy = mpg {1 +3 (m— 2) pat, 
and A=", By = Fs, 
Be be 
In cases like the present, c may be taken as unity. 

For a “normal” curve to be a good fit to the binomial, 8; should be very small 

and 8, nearly equal to 3. 


Take as an illustration the values of 8, and £, for different values of m where 
p= 02, g='98. 


We obtain: 
m By | Bey 
| 

100 ‘4702 | 3°4502 
200 api! eV 
300 “1567 | 3° 1501 
400 ‘1176 3°1126 
500 ‘0940 3°0900 
600 ‘0784 3°0750 
YOO “0672 3°0643 
800 0588 30563 
900 "0522 3°0500 

1000 ‘0470 3°0450 
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From these figures it is plain that even comparatively large values of m do not 
admit of the binomial being closely approximated to by a “normal” curve. 

Since, however, direct evaluation of the terms of the binomial is very tedious 
when m is at all large, we need a curve which bears the same relation to the skew 
binomial that the normal curve does to the symmetrical binomial. Such a function 
was provided years ago by Pearson*, viz. his Skew Curve of Type III, 


¥ = ye (1 oF mY ’ 


4 
where s= ea = 1, 
eae m 
Y “3 = ji (taking the unit of measurement c= 1 as before), 
eee ae 
SG Ti 


To use this curve with the rapidity possible in the case of a “normal” curve, 
we need tables not at present published. 

In any particular case, however, the curve may be calculated and the area 
between assigned ordinates approximated to with little labour. 

To sum up, we have the following rules for practical work when j is known or 
assumed. 


(1) When m is small, say less than 25, the binomial expansion should be 
directly evaluated. 


(2) When m is moderately large and p or ¢ not small, say not less than ‘1, 
the ordinary method based on the “normal” curve can be trusted. 


(3) IZf m is moderately large and f or g less than ‘1, a skew curve of Type III 
should be fitted from the momental constants of the binomial and the areas 
between assigned ordinates estimated with the help of quadrature formulae. 


III. 


If in n trials an event happened p times and failed q times, what is the probable 
distribution of successes and failures in m subsequent trials and what are the 
respective chances of 0, 1, 2, ... m successes in m trials, rt being assumed that the 
occurrences are independent and that the “universe” of events is indefinitely greater 
than n +m? 

This problem is of fundamental importance. We note at once that when the 
last condition is imperfectly fulfilled an important special case may arise, for we 
then have :— 

n+m,. : 7 es : x 

Save finite where V is the number of events comprising the “universe 


* For a recent précis of the relevant facts, see Pearson, K. ‘‘On the Curves which are most suitable 
for describing the Frequency of Random Samples of a Population,” Biometrika, 1906, Vol. v. p. 172. 
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or “population” from which the samples come. This problem of drawing from 
a “limited universe” will not be considered in the present memoir; it has been 
discussed in the paper of Pearson last cited *. 

_ The class of problem to which attention is now directed may be typified as 
follows :— 

Fifteen “control” rats have been inoculated with a constant dose of a standard 
culture of plague bacilli and twelve succumbed in a certain time. Ten similar rats 
have been immunised by a method it is desired to test and five of these died after 
inoculation with a dose of culture similar to that employed upon the “ controls.” 
What is the probability that the deviation from the rate of mortality obtaining 
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among the “controls” is a chance event ? 


Evidently the methods of pp. 69—72 cannot be used. To state that the a priori 
chance of dying is ‘8 is to ignore the fact that the size of the “control” sample 
does not justify us in assuming that its proportional yield approximates at all 
closely to that of the whole population. 

Let us, then, start from first principles, merely assuming (an assumption based 
on or supported by the fairly wide practical experience of civilised humanity) that 
all possible events are, in the absence of any grounds for inference, equally likely 
(Bayes’ principle). 

On this assumption, we have, by Bayes’ Theorem for the chance P, that the 
true probability of an event, observed to happen p and fail g times in 1 trials, is 


between w and «+ 6a: 
a? (1 —a)idax 


i 


i P 
| a? (1—«)! da 
( 


) 


P, 


.A second trial of m being made, the total chance of its yielding r successes 
and m—r failures is: 

1 
| eae (al as a)jatm—r da 
J0 ; 
Le: Ae ee Oe ee eee (1). 
ri(m—r)! i 

( ) | a? (1—«a«)tda 

{ 


~ 0 


m! 


This is, in modern notation, the result contained in the 7th of Condorcet’s 
problems published in his Lssai, 1785+, but Laplace had, eleven years previously, 
m! 
ri(m—r)! 
standard model of an urn from which balls are drawn, he assumed the drawings to 

have been made in an assigned order). 


given the theorem with the omission of the term (1.e. working on the 


To Pearson *, whose symbols will be used, belongs the credit of emphasizing the 
enormous statistical value of the theorem. The usual method of treating (1) has 

* See Pearson, op. cit. pp. 173—5. 

+ See Todhunter’s History of the Theory of Probability, p. 383, and for a similar result obtained by 
a different process in 1795 by Prevost and Lhuilier, op. cit. p. 453. 

~ Karl Pearson, ‘‘On the Influence of Past Experience on Future Expectation,” Philosophical 
Magazine, 1907, p. 365. 
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been to show that, under certain conditions, the probabilities of different values of 
r can be represented by the ordinates of a “normal” curve*, The nature of the 
assumptions involved will be placed in the clearest light by the following 
considerations. 

Substituting 0, 1, 2, ... m for r successively in (1), reducing to B and then to 
I functions and finally evaluating each term, we have for the chances of 0, 1, 2,...m 
successes in a sample of m after a first sample n=p+q: 
Co cpa TD ie as =)... : ae (2), 

i Tig+m 2) amg 4-7 — 1) 
D(qtm+1)U (n+ 2) 
IG FG) 

We may notice that, if p and gq are both very large as compared with m, 
(2) reduces to 


(4) (1 nes ae (m — 1) p? ed ete.) 


where C= 


n ql! 2!q? 
= (2) (1 Se a =(p+q)™", where p =f and=¢ = 4 Beara (2) bes. 


The conditions for the approximation of this binomial to the “normal” curve 
have already been noted. 

More directly, the approach of (2) to a “normal” form can be examined by 
treating the series in brackets, which is a hypergeometric series having as 
parameters 

a=—m, B=pt+l1, y=-(qt+m), 6=1, 
by the method of moments and then noting the conditions under which the 
momental constants 8, and ®, approximate to the values 0 and 3 respectively. 
This method was adopted by Pearson who had, several years before the date of 
the publication last cited, obtained the moment coefficients of a hypergeometric 
series t. 
The results are that: 


(m— 1)? 
pie a aadces)) Uhpote a———e (3), 


~ m(p+i1)(qt+1) prtaes 


m m—Il1 
Te 3 
m— I m—2 
Geo a aera (ees) a 
m(p+1)(q+1) m1. ae ; 


* See, for instance, Ozuber’s Wahrscheinlichkeitsrechnung (1903 Edition), pp. 151 ete. 
+ Karl Pearson, ‘‘On Certain Properties of the Hypergeometrical Series, and on the fitting of such 
Series to Observation Polygons in the Theory of Chance,” Philosophical Magazine, 1899, p. 236. 
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If we write p =f, q =, a and if m and n are both absolutely large, 


we find for (3) and (4) 


a 


m)\? 
eee ”) 
(¢— py ( n 


Spat aes i A SE Le ee oo), 
Om (p+OG-) ,,™ ae 
n 
Mm ML 
Pees r Meteo) aus 
2 m (p+ 5) G ma é) ie nn ee \ 
n 

If now m be small relatively to n, 

B Cia De and B,=3+ ee ee (5). 


" m(p +e) (G—e) 

If n be small relatively to m, 

4 (7 —py? 6 

A= - = i B a nd ~.=3+ apra@se (6). 
After exhibiting these results, Pearson remarks*: “ Both forms result—for 7 or 
m large and the product of either with p and g not small—in 8, =0 and 8, =3, 
ie. in the symmetry and mesokurtosis, which are for practical purposes closely 
enough represented by the Gaussian curve. But if m and n be commensurable, 
and either p or g moderately small, this result by no means follows.” 

It is accordingly plain that in all cases of m and n both small the use of 


m (p +e) (q—e) 


a “normal” curve with s.D.=/mpq is inappropriate. When p= the condition 
of mesokurtosis is fulfilled and the divergence from “normality” reduces itself 
to the difference between the Gaussian and Pearson Type II curves, The 
accompanying table illustrates this in a particular example. 


A Second Sample of 10, after a first Sample of 100; p=q=50. 


Comparison of Series with Curves (Totals = 100). 


| | 
hes Hypergeometric | Normal Curve Normal Curve ane Type 1 es 
Successes yp Seives | j— = 93412 (1 x 10°22177 
erles | 8. D.Vnpq S. D. V(n+1) pq i— 260 2( - a5) 
| 
| | 
0 146 O71 opis 188 
1 1°243 1122 1°408 1-331 
2 4°931 4°349 4°843 5‘091 
8 12°017 11°447 1702 12°107 
4 19°922 20°452 | 19°728 19°729 
5 23-480 24°817 | 23972 23°105 
6 19°922 90°452 | 19°728 19°729 | 
7 12°017 11°447 | 11°702 12°107 | 
8 4°931 4°349 4°843 5091 
2) E243 1°122 | 1°408 1°331 | 
10 146 22) 333 188 | 
| 


* Op. cit. (1907), pp. 371—2. 
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A few arithmetical results may now be given. 


Let n=100 and m=50. 


Area * of ‘*‘ Normal’’ 


From Series (2) Curve, with §. D. Vin PG 
p=4,. 7=6 
Chance of 20—22 Successes 255+ ‘271 
p='l, 7=9 
Chance of 5—7 Successes 3193 4739 
a O—2 3 1311 1145 
=O, G9) 
Chance of 0O—2 Successes 8938 9202 
Ri 3—5 PA ‘1007 0022 


We see how the liability to error increases with p ~ @. 


An interesting special case may be discussed here which emphasizes the 
importance of the problem indicated. 


Suppose the first sample has given all successes or all failures, so that p or 


g=0, how are we to measure its reliability ? 


Many unsophisticated users of formulae must have been puzzled by this case, 
since, construing the formulae aw pied de la lettre, it would appear that after 
n successes in n trials, we ought to get m successes in m with a probable error 


of 0} 
The paradox vanishes if we consider (2). Put in it n =p and we have 


a aren Was ORSP a), 


(n+m-+1)! 1! 2! 


From this we see that the ratio of the (m-+1)th term to the whole sum (i.e. the 
: : 1 ; 
chance of m successes in m trials) is eee ; from which we conclude: 
n+m+1 
(a) Only when n is very large as compared with m does the chance of 
obtaining 100 °/, successes in m trials approach unity. 


(b) In particular if =m and both are large, the chance is about °5. 


For instance if we have had 100 °/, successes in 200 trials the chance of getting 
the same proportion in a subsequent 50 is about 4 to 1. If, on the other hand, 
n = 50 and m = 200 it is 1 to 4. 


In view of what follows it may be worth noticing that a closed expression for 
the sum of any number of terms of (7) can readily be given. 


* Taking for area corresponding to x successes, the area between the ordinates x —*5 and «+°5. 
+ Approximate only, obtained by using Stirling’s theorem in the expression 


m!(p+r—-l1)i(qtm—r+ lt 
(m—r+ 1)lp!(q+m)l(r-1)!? 


to find the rth term of series (2). 
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Write in Euler’s identity * 
1 — a + a, (1 — ay) + GQ (1 — Gg) + 0. + Oy... On (1 — ngs) = 1 — GOs ... Anyi, 


x “+p 
Ay = D1 ote, 


OS ea M2 
¥ Gare pi 
Multiply by 2 and subsequently put y = 0. 
; 2 ae (a+! at) / 2 WD te 10), 
eee: te 4 DPD SS eS " (“Pe oe (8). 
Pi Pips Pr Pe Pn 
Putting MEl=2 p= 1, p= 2). Pm =m 
] ») y=m 
ee ia 1 (at DiGi) ee a (1 nee . *) Core (9). 
ms y=1 


Reverting to the general case, we note that for testing the divergence between 
first and second samples the formula (2) must always be employed when m and n 
are commensurable and p~@ not small. This rule certainly applies to all cases of 
m and n less than 300 or 400 and p (or q) <1. If m and n be large the best 
plan will be to fit to (2) the curve indicated by the momental constants, using its 
proportional areas (obtained by some convenient quadrature formula) precisely in 
the manner adopted with the tabled areas of the “normal” curve. 


Such a method is, however, not convenient for laboratory workers nor specially 
appropriate when m is a small number, since in that case the terms of the dis- 
continuous series are not closely represented by a continuous curve. 


Evidently what one needs is a tabulation of the series (2) for different values of 
m, n and p. 


Were it possible to obtain a simple formula for the sum of any assigned 
number of terms of (2), the computation of such a table would be a rapid process. 


In the particular case p = 0 or n, such a formula has been given above. In the 
general case I have not reached any result} and more widely trained mathematicians, 
who have kindly allowed me to consult them, do not regard the problem as a 
simple one. 


I therefore fell back upon the method of direct calculation. This is a straight- 
forward but irksome task ft. 


* See Chrystal’s Algebra, Vol. 11. p. 392, Ed. 1889. 

+ Formulae are available in certain types of Hypergeometric Series. Vide M. J. M. Hill, “On a 
Formula for the Sum of a Finite Number of Terms of the Hypergeometric Series when the Fourth 
Element is equal to Unity,” Proc. Lond. Math. Soc. 1907, Series 2, Vol. v. p. 335; and 1908, Series 2, 
Vol. vi. p. 339. The methods of these papers cannot be used in the present case. 

+ Sir Ronald Ross and Mr W. Stott have recently published (‘‘ Tables of Statistical Krror,” Annals 
of Tropical Medicine and Parasitology, Vol. v. No. 3, 1911) a set of tables for the use of laboratory 
workers, Their tables will be of great service in the cases in which p is not less say than 0-1, but are 
not, I think, available for the class of problem discussed in this paper, since they appear to be based 
on the “normal” theory of errors. It must be noticed that in an immense number of examples which 
arise in medical work p will not only be less than 0:1 but less than 0:01 (the prevalence of mental defect 
in children, albinism, epilepsy, ete. are instances), and for such cases the ‘‘normal” treatment is, as 
pointed out above, inappropriate and often misleading. 
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For the benefit of those who wish to extend my small tables, it may be worth 
indicating the arithmetical arrangement which I have found most convenient. 
I use the following scheme: 


n,m a 
esis ie C, 
q+m 100 CG, 
Term Multiplier ; 
Tape) 
: (a) (¢+m)1 2) 
nepsad) (cre Dei) 
Gam Ginone | 
a ; (m — 2)(p +8) 
Bb (y) (q+ m—2)3 (c) 
ye (6) (d) 


ml ba 1 
The values of n, m, p,q and q+m are written at the top of the sheet, — and 
0 


100 C, are caleulated and written in the right-hand top corner. 


Two columns are next formed; the entries in the right-hand column having 
been made, any given term of the left-hand column is the product of the entries 
in the columns immediately above it. The entries in the left-hand column are 


added up and the sum checked by comparing it with a Finally each term is 
‘0 


converted into a percentage by multiplying with 100 (,. 


In this way, provided one has a mechanical calculator, a series having only 
a moderately large number of arithmetically significant terms is rapidly evaluated. 
Still, when all is said, the calculation of a table for values of m and n ranging from 
say five to a hundred and p from 0 to ‘1 would need an heroic amount of patience. 
Even the present admittedly imperfect results have involved the expenditure of 
some little time and effort* and it was necessary to consider how best to utilise 
our limited resources. 


Having chiefly before my eyes the needs of laboratory workers, I felt sure that 
the cases of m and n not greater than 25 were of the most importance. Probably 
in the type of problem alluded to on p. 73 the “control” should be regarded as 
our n and it is usually possible to arrange the experiments in such a fashion that 
animals at least equal in number to those specially tested serve as the control. 
When it is possible to plan a large control, it is usually practicable to fix the 


* T desire heartily to thank my assistant, Mr J. W. Brown of the Lister Institute Statistical 
Department, to whose zealous co-operation in the arithmetical work I am greatly indebted. 
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number arbitrarily, so that it seemed sufficient to give in tabular form the results 
of small samples after first samples of 50 and 100 without calculating the inter- 
mediate cases. 


The next question is as to whether, within the limited field chosen, interpolation 
can be trusted. Accurate methods of interpolation in the case of double-entry 
tables are a little complex* and not likely to appeal to the man in the laboratory. 
What is really material is whether simple interpolation is likely to lead to seriously 
erroneous conclusions. 


I now proceed to some tests. 


(1) A first sample of 17 having given 3 successes, required the probability 
that a second sample of 14 will contain 4 or more successes, 


From the tables for n= 20, p=3, m=10 and n=15, p=3, m=10, we have 
for the proportional frequency of 4 or more successes in m trials, 


12°8062 
23°0040 
10°1978 
which gives by simple interpolation for n=17, p=3, m=10, 
18°92488 (a). 


Similarly interpolating between the values for n = 20, p=3, m=15 and n=15, 
p=3, m=15, we have for n=17, p=3, m=15, 


3892372 (8). 


Interpolating between («) and (8) we reach for the proportional frequency of 
4 or more successes in 14 trials after n= 17, p=3, 3492. 


The true value obtained by direct calculation is 35:07601, which gives an error 
of “43 °/, in the interpolated value, a difference of no importance for such 
purposes as the present. In the accompanying table I have grouped together the 
results of a number of random trials made in different parts of the table. 
A perusal of these results leads, I think, to the following conclusions. 


(1) For values of m and n ranging in each case up to 25, interpolation, when 
necessary, gives results of sufficient exactitude for all the purposes likely to be 
served by such tables. 


(2) For greater values of m and n, particularly when the latter is greater than 
50, the differences are too great to allow of interpolation and for such values the 
table can only provide the reader with a general impression (which is often enough 
sufficient) as to the limits within which possible variations from the proportions 


* Vide W. Palin Elderton, ‘‘ Interpolation by Finite Differences (Two Independent Variables),” 
Biometrika, 1903, Vol. 11. p. 105; W. Palin Elderton, ‘“‘Some Notes on Interpolation in n-dimension 
Space,” ibid. 1908, Vol. vr. p. 943; also the Introduction to the British Association Tables of I(r, v) and 
H (r, v) Functions (issued by B. A. 1899, p. 56). 
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found in the first sample are likely to fall. As remarked above, it was thought 
that when the first sample exceeded fifty its numerical composition would often be 
at the choice of the worker. I think, therefore, that these tables are likely to 
serve most of the objects I had in mind when the work was undertaken, although 
it is much to be desired that someone will have leisure considerably to extend 
them. I do not see any immediate prospect of being that person. 


The class of problem in which this species of investigation seems desirable has 
already been described and the reader is perhaps not anxious to see any more 
arithmetical examples. I may, however, give a single concrete instance of the 
kind of research in which, I hope, the tables will be of value. 

Tests of the Accuracy of Simple Interpolation. 
[The True Values are given in brackets, | 


Example I. -»=22, p=4, m=16. 


0—3 Successes 1—3 Successes 3 Successes 
57°38 (57°49) 51°27 (51°65) 18°70 (19°14) 
Example II]. n=37, p=4, m=22. 
0—8 Successes 1—2 Successes 3 Successes 
62°85 (67°89) 36°47 (39°83) 16°64 (18°88) 
Example Ill. n=71, p=4, m=43. 
0—2 Successes 1—3 Successes 3 Successes 
46°43 (47°53) 50°60 (56°66) 15°21 (18°25) 
Example IV. n=100, p=4, m=39. 
0—3 Successes 1—2 Successes 3 Successes 
83°84 (84:93) 48°59 (50°78) 13°78 (15:15) 


In a paper by Rous*, several experiments of the following kind are detailed. 


15 mice+ were injected intraperitoneally with a suspension of mouse embryo in 
normal saline and 11 days later reinjected with the same substance. Ten days 
after the second injection they were inoculated subcutaneously with a mass made 
from mouse embryos 1'5 cm. long, and 17 previously untreated mice were inoculated 
at the same time to serve as a control. 


In only one of the 17 control mice was no graft found at the autopsy, but 8 of 
the treated mice did not “take.” If we wish to know whether this difference 
be an effect of the intraperitoneal inoculations, we may put n=17, p=1, and 
ascertain the chance that in m=15 there would be 8 or more successes. 


From the tables, with interpolation, I find the odds against such a result to be 
about 260 to 1. In other words it is very likely that the treatment has led to the 


* « An Experimental Comparison of Transplanted Tumour and a Transplanted Normal Tissue 
Capable of Growth,” by Peyton Rous, M.D., Journ. Experimental Medicine, 1910, Vol. x11. p. 344. 
+ I take the number stated in the text but can only identify 14 in the corresponding table. 
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observed result. It may be remarked, however, that had we used a normal curve 
; - 1x16 : 

with 8.D. / 18 iy’ the odds estimated therefrom would have been enormously 

greater. 

In conclusion, I desire to refer to a subject indirectly related to the topic of this 
paper and, I think, of importance. We are familiar with such arrangements of 
material as the following. mn, persons have been immunised against a certain 
disease and, having contracted the disease, a, have died. Of n, not immunised 
a, have died (2< a). The extent of protection conferred is then estimated by 

1 Pp) 
some coefficient of correlation or association. The trustworthiness of the coefficient 
so calculated is then measured by a comparison between its arithmetical value and 
that of its standard deviation or “ probable error.” This process has its limitations. 
If, for instance, either a, or a, be zero, Yule’s coefficients of association and colli- 
gation become unity and their standard deviations are indeterminate. 


I would put forward for consideration the possibility that the use of Bayes’ 
theorem might here be of value. Thus: 


Let the chance of a, or more successes in n, after a, successes in n, be p, and 
the chance of a, or less successes in 7, trials after a, successes in n, trials be p,. 
Then, since either n, or n, might have been drawn first, a measure of the probability 


of the observed result will be pe 


We might indeed adopt a scale of reliability by putting P=/ (2 xe), the 
Pit Po 


5 diminishes to zero. 


function being such that P increases to unity as 


I put forward these suggestions with some doubt, but I cannot help feeling 
sure that in such cases as those I have instanced the ordinary method of testing 
the reliability of a coefficient of association is a dangerous and possibly misleading 
artifice*. 

In conclusion I desire to express my regret that this paper is so imperfect. 
The problems treated require mathematical abilities and training not at my 
command. I have only ventured to write upon the subject because of its practical 
importance and may, perhaps, venture to entertain the hope that my numerous 
mistakes of omission and commission will be leniently treated. 


* [The probability corresponding to the x? of the fourfold table can be calculated straight away ; but 
the difficulty arises from our not mentally appreciating grades of probability with the readiness we 
appreciate grades of correlation on a limited scale. Eprror.] 
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TABLE. Percentage Frequency of Successes in a Second Sample 


Successes 
n=6) O 
m=5\ 1 
Q 
3 
4 
oO 
n=6| O 
m=6( 1 
ag 
3 
4 
5 
6 
(P= TI! 
Mi —oi\ el 
2 
38 
4 
a) 
n=7T| O 
m=6{ 1 
2: 
3 
4 
5 
6 
n=7| O 
m=T{ 1 
Q 
3 
4 
3: 
6 
7 
n=8| O 
m=5f 1 
D 
8 
4 
5 
n=8| O 
m=6{ 1 
Z, 
4 
o 
6 
n=8| 0 
m=7{ 1 
Q 
3 
4 
‘55 
6 
7 


On Errors of Random Sampling 


drawing “p” Successes in a First Sample “n”. 


p=0 
58°3333 
26°5151 
10°6060 
3°5354 
*8838 
*12638 


53°8462 
26°9231 
12°2378 
4°8951 
1°6317 
4079 
0582 


61°5385 
25°6410 
9°3240 
2°7972 
6216 
‘O777 


57°1429 
26°3736 
10°9890 
3°9960 
1°1988 
"2664 
0333 


53°3333 
26°6667 
12°3077 

5°1282 


60-0000 
25°7143 
9°8901 
3°2967 
8991 
1798 
“0200 


56°2500 
26°2500 
11°2500 
43269 
1°4423 
3934 
‘0787 
‘0087 


p=l1 
31°8182 
31°8182 
21°2121 
10°6060 
3°7879 
‘7576 


26°9231 
29°3706 
22°0280 
13°0536 
6°1189 
20979 
“4079 


35°8974 
326340 
19°5804 
8°7024 
2°7195 
4662 


30°7692 
30°7692 
20°9790 
11°1888 
4°6620 
1°3986 
2331 


26°6667 
28°7179 
21°5385 
130536 
65268 
2°6107 
‘7615 
1243 


39°5604 
329670 
17°9820 
7°1928 
1°9980 
2997 


34°2857 
31°6484 
19°7802 
95904 
3°5964 
"9590 
1399 


30°0000 
30°0000 
20°7692 
11°5385 
5°2448 
1°8881 
“4895 
‘0699 


p=2 
15°9091 
26°5 151 
26°5151 
189394 

9°4697 

2°6515 


12°2378 
22°0280 
24°4755 
20°3962 
13°1119 

6°1189 

1°6317 


19°5804 
29°3706 
26°1072 
16°3170 
6°9930 
1°6317 


15°3846 
25°1748 
25°1748 
18°6480 
10°4895 
4°1958 
9324 


12°3077 
21°5385 
23°4965 
19°5804 
13°0536 
6°8531 
2°6107 
5594 


230769 
31°4685 
25°1748 
13°9860 
5°2448 
1°0489 


18°4615 
276923 
25°1748 
16°7832 
8°3916 
2°9370 
5594 


15-0000 
24°2308 
24°2308 
18°3566 
11:0140 
5°1399 
1°7132 
3147 


p=3 

70707 
17°6768 
25-2525 
252525 
17°6768 

7°0707 


4°8951 
13°0536 
20°3963 
23°3100 
20°3963 
13-0536 

4°8951 


9°7902 
21°7560 
27°1950 
23°3100 
13°5975 
4°3512 


6:9930 
16°7832 
23°3100 
23°3100 
17:4825 

9°3240 

2°7972 


5°1282 
13°0536 
19°5804 
21°7560 
19°0365 
13°0536 
6°5268 
1°8648 


12°5874 
25°1748 
279720 
20°9790 
10°4895 

2°7972 


9°2308 
20°1398 
25°1748 
22°3776 
14°6853 
6°7133 
1°6783 


6°9231 
16°1538 
220280 
220280 
17°1329 
10°2797 
4°4056 
1:0489 


” 


p=4 


6°2937 
17°4825 
26°2238 
26°2238 
17°4825 
6°2937 


4°1958 
12°5874 
20°9790 
24°4755 
20°9790 
12°5874 

4°1958 


2°8846 
9°1783 
16°5210 
21°4161 
21°4161 
16°5210 
9°1783 
28846 


“ 


m” after 


p=s 


Successes 


n=8) O 
m=8; 1 


3 
I tl 
or co 
—_—C 
MA COWH DS HWA NA bw 


3 
Il il 
© 
= 


me 


m=7 


WHNRAAS MHD NAA’ MORO AAR Co MRS 


n=9 
heat 


1S CONA AW Oo WHO 


Tw Ss WHO 


p=0 
52°9412 
26°4706 
12°3529 
5:2941 
20362 
‘6787 
“1851 
0370 
0041 


66°6667 
23°8095 
7°3260 
1°8315 
3330 
0333 


62°5000 
25 0000 
89286 
2°7472 
6868 
*1249 
0125 


58°8235 
25°7353 
10°2941 
3°6765 
1°1312 
2828 
0514 
0051 


55°5555 
26°1438 
11°4879 
4°5752 
1°6340 
5027 
1257 
0229 
0023 


52°6316 
26°3158 
12°3839 
5°4180 
2°1672 
‘7740 
"2381 
0595 
0108 
‘0011 


54°5454 
27°2727 
12°1212 
4°5454 
1°2987 
2165 
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TABLE—(continued). 


p=1 
26-4706 
28°2353 
21°1765 
13°0317 
6°7873 
2°9617 
1-0366 
"2633 
‘0370 


42°8571 
329670 
16°4835 
59940 
1°4985 
"1998 


37°5000 
32°1429 
18°5439 
82418 
28097 
67438 
0874 


330882 
30°8824 
19°8529 
10°1810 
4°2421 
1°3883 
3239 
0411 


29°4118 
29°4118 
20°5882 
11°7647 
5°6561 
2°2624 
*7199 
1645 
0206 


26°3158 
27 *8638 
20°8978 
13°0031 
6:9659 
3°2151 
1°2503 
3897 
0877 
“0108 


272727 
30°3030 
22°7273 
129870 
5°4112 
1-°2987 


9 


12°3529 
21°1765 
22°8054 
19-0045 
12°9576 
7°2563 
3°2250 
1:0366 
1851 


26°3736 
329670 
239760 
11-9880 
3°9960 
6993, 


21°4286 
29°6703 
24°7253 
14°9850 
6°7433 
20979 
“3496 


17°6471 
26°4706 
24°4344 
16°9683 
9°2554 
3°8873 
1°1518 
1851 


14°7059 
235294 
23°5294 
18-0995 
11°3122 
5°7589 
2°3036 
6582 
"1028 


12°3839 
20°8978 
22°2910 
18°5759 
12°8602 
75018 
3°6372 
1°4029 
3897 
"0595 


12°1212 
22°7273 
25°9740 
21°6450 
12°9870 

4°5454 


p=3 
52941 
130317 
19-0045 
20°7322 
18°1407 
129000 
72563 
29617 
‘6787 


15°3846 
279720 
27°9720 
18°6480 
8°1585 
1°8648 


11°5385 
23°0769 
26°2238 
20°9790 
12°2378 

4°8951 

1:0489 


8°8235 
19-0045 
23°7557 
21°5961 
15°1172 
8:0625 
3°0234 
‘6170 


6°8627 
15°6863 
21°1161 
21°1161 
16°7969 
10°7500 
5°3750 
1°9197 
3771 


5°4180 
13:0031 
18°5759 
20°0047 
17°5042 
12°7303 
76382 
3°6372 
1°2503 
2381 


4°5454 
12°9870 
21°6450 
25°9740 
22°7273 
12-1212 


p=4 
2°0362 
6°7873 
12°9576 
18°1407 
20°1563 
18°1407 
12°9576 
6°7873 
2°0362 


8°3916 
20°9790 
279720 
24°4755 
13°9860 
4°1958 


5°7692 
15°7343 
23°6014 
24°4755 
18°3566 
9°4405 
2°6224 


40724 
11°8778 
19°4364 
226759 
20°1563 
136055 

6°4788 

1°6968 


2°9412 
9°0498 
15°8371 
20°1563 
20°1563 
16°1250 
10°0782 
4°5249 
1°1312 


2°1672 
69659 
12-8602 
17°5042 
190955 
17°1859 
12°7303 
775018 
3°2150 
“7740 


p=5 


11-2 


83 
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Percentage Frequencies of Successes in a Second Sample “m”. 


Successes 
n=10| 0 
m= 5{ 1 
2 
3 
4 
5 
n=10| 0 
m=10{ 1 
Q 
3 
4 
5 
6 
7 
8 
9 
10 
n=15| O 
m= 5{ 1 
a 
y 
5) 
n=15| 0 
m=10{ 1 
4 
5 
6 
ef 
8 
9 
10 
n=15\| O 
m=15{ 1 
3 
B 
5 
6 
di 
8 
9 
10 
11 
12 
13 
14 


p=0 


68°7500 
22°9167 
6°5476 
1°5110 
"2518 
"0229 


52°3809 
26°1905 
12°4060 
5138 
"2704 
"8514 
*2838 
‘0811 
‘0187 
0031 
“0003 


bo Or 


76°1905 
19:0476 
4:0100 
6683 
0786 
0049 


61°5384 
24°6154 
9°2308 
3°2107 
1:0216 
*2919 
‘0730 
0153 
0025 
“0003 
“0000 


6129 
“8065 
4583 
“7842 
5707 
0876 
4351 
1631 
‘0567 
0181 
“0052 
‘0013 
“0003 
“0001 
“0000 
“0000 


me! bo Or 


mm bo OUD OU 


joi 


45°8333 
32°7381 
15°1099 
5°0366 
1°1447 
1373 


26°1905 
27°5689 
20°6767 
12:9736 
70949 
3°4056 
1:4190 
“4990 
"1408 
0284 
0031 


57°1429 
30°0752 
10°0251 
2°3588 
*3685 
0295 


36°9231 
30°7692 
18-0602 
8°7565 
3°6485 
1°3135 
4032 
“1024 
0203 
‘0028 
“0002 


25 8065 
266963 
200222 
12°8538 
7°4156 
3°9155 
1:9033 
8512 
3482 
*1289 
0426 
0122 
0029 
“0006 
‘0001 
“0000 


p=2 


29°4643 
33°9972 
226648 
10°3022 
3°0907 
“4807 


124060 
20°6767 
2178930 
18°2441 
12°7709 
76625 
3°9295 
1°6841 
‘5741 
“1403 
‘0187 


421053 
35°0877 
16°5119 
5°1599 
1°0320 
"1032 


21°5385 
28°0936 
22-9857 
14°5941 
7°6619 
3°3874 
1:2546 
3795 
0889 
0145 
0012 


12°4583 
20°0222 
20°7638 
17°3032 
12°4583 
79941 
4°6342 
2°4375 
1:1607 
4965 
1882 
0618 
“0169 
0037 
“0006 
“0000 


p=3 


18-1318 
30°2198 
27°4725 
16°4835 
64103 
1°2820 


5°5138 
12°9736 
18°2441 
19°4604 
17-0278 
12°5744 
7°8590 
4°0826 
1°6841 
4990 
0811 


30°4094 
35°7757 
223598 
8°9439 
22360 
"2752 


12°1739 
22°1344 
23°7154 
18°9723 
12°2322 
675238 
2°8781 
1:0279 
2827 
0538 
0054 


5°7842 
12°8538 
17°3032 
17°9953 
15°7459 
12:0490 
8°2152 
50297 
2°7663 
1°3589 
5889 
2204 
‘0689 
‘0170 
0029 
0003 


p=4 


10°5769 
24-0385 
28°8461 
22°4359 
11-2179 

2°8846 


2°2'704 
70949 
12°7709 
17:0278 
18°3377 
16°5039 
12°5030 
78590 
3°9295 
1°4190 
2838 


21°4654 
33°5397 
26°8318 
13°4159 
4:1280 
6192 


6°6403 
15°8103 
21°3439 
209694 
16°3095 
10°3613 
5°3965 
2°2614 
*7269 
1615 
0189 


2°5707 
7°4156 
12°4583 
15°7459 
16°4305 
14°7874 
11°7361 
8:2991 
5°2415 
2°9443 
1°4548 
“6200 
"2204 
‘0618 
0122 
‘0013 


p=5 


5°7692 
17°3077 
26°9230 
26°9230 
17°3077 
5°7692 


8514 
3°4056 
7°6625 
12°5744 
16°5039 
18-0043 
16°5039 
12°5744 

76625 

3°4056 

8514 


14°7575 
29°5149 
29°5149 
18°1631 
68111 
1°2384 


3°4783 
10°4348 
17°2997 
20°5034 
18:9958 
14°2469 
8°7064 
4°2644 
1°5991 
4146 
0565 


1:0876 
3°9155 
7°9941 
12°0490 
14°7874 
15°4916 
14°2006 
11°5313 
8°3282 
5°3344 
3°0006 
1°4548 
5889 
"1882 
0426 
0052 


p=6 


9°8383 
24°5958 
30°2717 
22-7038 
10°3199 
2°2704 


1°7391 
6°4073 
12°8146 
18°0913 
19-7873 
1774128 
12°4377 
7°1073 
3°1094 
9423 
“1508 


4351 
1°9033 
4°6342 
8°2152 

11°7361 
14-2006 
14°9480 
13°8803 
11°4309 
8°3350 
533844 
2°9443 
1:3589 

“4965 

“1290 

0181 


p=1 


6°3246 
19-4604 
29-1906 
26°5369 
14°5953 
3°8921 


8238 
3°6613 
8°7226 

14°5376 
18-6566 
19°1897 
15-9914 
10-6609 
54516 
1:9383 
3661 


1631 
8512 
2°4375 
5°0297 
8°2991 
11°5313 
13°8803 
14°6968 
13°7783 
11-4309 
83282 
5°2415 
2°7664 
1°1607 
*3482 
0567 


Successes 


Mm 


Bee 


5 


n=20| 
m=10{ 


S 


SUANAAS SCSMRS TAR Ss tH 


p=0 


80°7692 
16°1538 
2°6923 
3512 
0319 
0015 


67°7419 
225806 
70078 
20022 
“5191 
1198 
"0240 
0040 
“0005 
“0000 
“0000 


58°3333 
250000 
10°2941 
4:0553 
1°5207 
5396 
“1799 
0558 
0159 
“0041 
‘0010 
0002 
“0000 
“0000 
“0000 
“0000 


51°2195 
256098 
12°4765 
5°9099 
2°7154 
1°2068 
5172 
0839 
0315 
0112 
0037 
0012 
“0003 
‘0001 
“0000 
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TABLE—(continued). 


Dl 


64°6154 
26°9231 
70234 
1°2770 
"1520 
0091 


45°1613 
31°1457 
15°0167 
5°9325 
1°9965 
5750 
“1398 
0278 
0043 
“0004 
“0000 


33°3333 
29°4118 
18°7166 
10°1381 
4°9056 
2°1584 


p=2 


51°1538 
33°3612 
12°1313 
2°8884 
4333 
‘0319 


29°5884 
31°7019 
211346 
10°8382 
4°5521 
1°5932 
4618 
1079 
0193 
0024 
“0002 


18°6275 
25°4011 
222259 
15°5343 
9°3206 
4°9495 
2°3569 
1:0101 
3885 
1330 
0399 
“0102 
0022 
“0004 
“0000 
“0000 


12°4765 
19°6998 
20°2323 
16°8602 
12°2839 
80929 
4°9048 
2°7589 
1°4462 
“7070 
3218 
"1358 
0528 
0188 
‘0060 
‘0017 
“0004 
“0001 
“0000 


p=3 


40°0334 
36°3940 
17°3305 
5°1992 
‘9577 
0851 


19°0211 
28°1795 
24°3861 
15-6071 
79661 
3°3250 
1°1335 
*B084 
0636 
“0089 
“0006 


10°1604 
19°0508 
21°5090 
18°6411 
13°4987 
8°4849 
4°7138 
2°3310 
1:0256 
3989 
1353 
0391 
0093 
‘OO1L7 
0002 
“0000 


59099 
12°7783 
16°8602 
17°3419 
15°1742 
11°7715 

8°2768 

533899 

3°1817 

1°7554 
*8965 
4226 
1829 
0720 
*0255 
‘0080 
“0022 
“0005 
“0001 
“0000 


p=4 


30°9349 
36°8273 
220964 
8°1408 
1°8090 
1915 


11°9763 
23°0313 
24°8738 
19-3463 
11°7760 
5°7809 
2°2940 
°7210 
"1707 
"0274 
0023 


5°3977 
13°0590 
18°2826 
19°1232 
16°3913 
12°0203 
7°7053 
4°3590 
2°1795 
9581 
*3658 
1188 
0317 
0065 
“0009 
0001 


2°7154 
75427 
12°2839 
15°1742 
15°6340 
14-0706 
11°3473 
8°3213 
5°5954 
3°4638 
1:9757 
1°0362 
“A974 
2168 
0848 
0293 
0087 
0022 
“0004 
‘0001 
“0000 


p=d 
23°5695 
35°3542 
26°0505 
11°5780 
3°0647 
3831 


7°3700 
17°6880 
23°2155 
21°5333 
15°4158 
8°8091 
4°0375 
1°4571 
3946 
0722 
“0068 


2°7859 
8°3578 
14°1218 
17°4841 
17°4841 
14°7942 
10°849]. 
6°9744 
3°9421 
1°9511 
8362 
3041 
“0907 
0209 
0033 
0003 


1°2068 
4°1377 
80929 
11°7715 
14°0706 
14°5245 
13°3141 
11°0186 
8°3131 
5°7473 
3°6474 
2°1221 
1°1274 
5429 
2345 
0893 
0293 
“0080 
‘0017 
“0003 
“0000 


85 


86 


Successes 


N=25 
m= 5 


n=25| 
m=15{ 


n=25) 
m= 20 ( 


° 


MAS So 2M 


NS 


© 
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p=—0 
83°8710 
13°9785 
1°9281 
*2065 
0153 
“0006 


72°2222 
20°6349 
54622 
1°3242 
‘2897 
0561 
0093 
0013 
‘0001 
“0000 
“0000 


63°4146 
23°7805 
8°5366 
2°9204 
"9472 
"2894 
0827 
0218 
70053 
70012 
“0002 
“0000 
“0000 
*0000 


56°5217 
25°1208 
10°8476 
4°5409 
1°8380 
"7172 
2690 
0965 
0330 
‘0107 
0033 
“0009 
“0002 
‘0001 
“0000 


TABLE—(continued). 


p=1 
69°8925 
241008 
51645 
‘7651 
‘0736 
‘0035 


51°5873 
30°3455 
12°4141 
4°1380 
1:1680 
*2803 
0564 
0092 
‘OO1L 
“0001 
“0000 


39°6341 
30°4878 
16°8485 
7°8930 
3°2888 
1°2403 
"4256 
1326 
0373 
“0094 
0020 
“0004 
‘0001 
“0000 
“0000 


31°4010 
28°5463 
18°9202 
10°8116 
5°6036 
2°6897 
1°2069 
5082 
*2009 
O744 
0257 
“0082 
0024 
“0006 
“0002 
“0000 


p=2 


57°8421 
30-9868 
9*1813 
1:7656 
2119 
0123 


36°4146 
33°1041 
18°6211 
8-0091 
2°8032 
“8119 
"1933 
0368 
0053 
“0005 
“0000 


24°3902 
28°8832 
21°8575 
13°1550 
6°7654 
30643 
1°2381 
4477 
“1444 
0412 
0102 
"0022 
“0004 
‘0001 
“0000 
“0000 


17°1278 
23°8993 
21°6231 
15°8218 
10-0864 
5°7932 
3°0491 
1°4833 
6695 
2806 
“1089 
0390 
0128 
0038 
‘0010 
“0002 
‘0001 


p=3 


47°5131 
35°1949 
13°5365 
32488 
4738 
0329 


25°3798 
31°7248 
23°0261 
12°2806 
5°1875 
1°7786 
4940 
“1086 
“O179 
“0020 
“0001 


14°7625 
23°9392 
23°2742 
17°2894 
10°6788 
5°6953 
2°6697 
1°1072 
“4060 
1307 
0364 
“0086 
0016 
“0002 
“0000 
“0000 


9°1614 
17°4502 
20°2168 
18°1951 
13°8796 
9°3504 
5°6861 
3°1589 
1°6133 
“7592 
*3290 
1308 
"0475 
0156 
“0046 
0012 
“0002 
“0000 


p=4 


38°7144 
37°2254 
17-8682 
5°2115 
9064 
0741 


17°4486 
28°1430 
25°3287 
16°3035 
8°1518 
3°2607 
1:0451 
2628 
0493 
“0062 
0004 


8°7777 
18°2869 
21°9443 
19°5777 
14°2383 
8°8100 
4°7366 
2°2329 
*9240 
3337 
1038 
0272 
0058 
0009 
0001 
“0000 


4°7988 
11°7044 
16°6788 
17°9618 
16°0711 
12-5094 
86871 
5°4604 
3°1317 
1°6449 
“7916 
*3482 
1393 
“0502 
0162 
0045 
0011 
“0002 
“0000 


p=5 


31°2693 
37°5232 
21°8885 
76134 
1°5573 
"1483 


11-8200 
23°6401 
25°6780 
19°5642 
11°4125 
5°2673 
1°9313 
5518 
“1170 
0165 
0012 


5°1203 
13°1666 
189753 
19°9337 
16°8191 
11:9361 
7°2943 
3°8807 
1°8017 
"7266 
"2515 
0732 
0173 
0031 
“0004 
“0000 


2°4579 
73738 
12°5733 
15-8820 
16°4186 
14°5943 
11°4669 
80943 
5°1816 
3°0226 
1°6088 
*7800 
3429 
1357 
‘0477 
0147 
0039 
0008 
‘0001 
0000 


Successes 


n=25| O 
m=25( 1 


n=650 
m= 5 


n=50 
m=10 


12, 1h, 15 


eal) 


50°9804 
25°4902 
12°4850 
59824 
2°8003 
1°2784 
5682 
2453 
1027 
0416 
0162 
0061 
0022 
0007 
“0002 


91°0714 
8°2792 
6133 
‘0347 
0013 
“0000 


836065 
139344 
2°1256 
2932 
0360 
0039 
0003, 
“0000 
0000 
“0000 
‘0000 


77°2727 
17°8322 
3°9008 
8049 
"1558 
0281 
0047 
0007 
0001 
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TABLE—(continued). 


p= 


25°4902 
26°0104 
19°5078 
12°7285 
76094 
4°2613 
2°2598 
J°1411 
5502 
2535 
1115 
0468 
0187 
0070 
0025 
0008 
0003 
"0001 


82°7922 
15°3319 
1°7357 
1335 
“0065 
‘0001 


69°6721 
23°6177 
5°4972 
1:0287 
"1607 
0210 
0023 
0002 
“0000 
“0000 
“0000 


59°4406 
27 8628 
9°2876 
2°5965 
*6385 
*1405 
0278 
0049 
0008 
0001 


p=2 


12°4850 
19°5078 
19°9229 
16°6024 
12°1751 
8°1352 
5°0451 
2°9344 
1°6103 
8365 
4118 
1921 
0848 
0353 
0138 
0051 
0017 
“0005 
0002 
“0000 


75°1263 
21°2621 
3°2711 
3207 
0192 
“0005 


57°8633 
29°9293 
9°4513 
2°2503 
"4296 
0668 
“0084 
0008 
0001 
“0000 
“0000 


455092 
32°5066 
14°6804 
5°2148 
1°5643 
4083 
0939 
0191 
00384 
0005 
0001 


p=3 

59824 
12°7285 
166024 
16°9713 
14°8499 
11°6037 
82883 
54870 
3°3951 
1°9732 
1-0801 
5573 
2709 
-1238 
0531 
0212 
‘0079 
‘0027 
0008 
0002 
0001 


680389 
26°1688 
5°1311 
6157 
“0440 
0014 


47°8869 
33°6048 
13°5019 
3°9278 
“8910 
1614 
0233 
0026 
“0002 
“0000 
“0000 


34°6737 
33°5552 
19°2530 
8°3429 
2°9695 
‘9011 
2371 
0544 
0109 
“0019 
“0003 


61°4967 
30°1454 
7°2349 
1°0335 
0861 
0033 


39°4857 
35°2551 
17°3070 
59827 
1°58038 
3282 
0536 
0067 
“0006 
“0000 
“0000 


26°2849 
32°3175 
22,6222 
11°6306 
4°8127 
1°6718 
*4976 
"1279 
0284 
0054 
“0009 
0001 


55°4676 
33°2805 
9°5087 
1°5848 
1517 
*0066 


32°4346 
35°3833 
20°6402 
8°3080 
2°5164 
5921 
“1085 
0152 
‘0015 
‘0001 
“0000 


19°8214 
29°7321 
24°6927 
14°7589 
6°9910 
2°7465 
"9155 
2616 
0642 
0132 
0024 
0003 
“0000 


87 


88 


Successes 


n=50| O 
m=20{ 1 


DW MVNA AW Ss % 


n= 50) 
m= 50{ 
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p=0 
71:8310 
20°5231 
5°6513 
1:4959 
3796 
0920 
0212 
0046 
0010 
0002 
-0000 


67°1053 
22°3684 
7°2546 
2°2857 
“6984 
*2066 
0590 
0163 
0043 
‘0011 
0003 
“0001 
“0000 
“0000 


50°4950 
25°2475 
12°4963 
6°1206 
2°9657 
1:4210 
6731 
3151 
*1457 
0665 
0300 
0133 
0058 
“0025 
‘OO11 
“0004 
“0002 
“0001 


TABLE—(continued). 


p=1 

51°3078 
29°7437 
124661 
4°4655 
1:4377 
*4247 
“1161 
"0295 
‘0070 
0015 
0003 
0001 


44°7368 
30°2276 
14°9068 
6°3492 
2°4592 
8853 
2994 
0956 
"0289 
0083 
0022 
‘0006 
‘0001 
“0000 
“0000 


25°2475 
25°5026 
19-1269 
12°6198 
7°7231 
4°4875 
2°5063 
1°3552 
"7126 
3654 
1831 
‘0898 
0431 
0203 
0093 
“0042 
“0019 
“0008 
0003 
‘0001 


p=2 
36°4360 
32°1494 
18-2340 
82882 
32515 
1°1380 
3613 
“1049 
0279 
0068 
0015 
0003 
0001 


29°6230 
30°4346 
20°2898 
10°9546 
5°1643 
2°2004 
8629 
3146 
1073 
0343 
0103 
“0029 
0008 
“0002 
“0000 
“0000 


12°4963 
19°1269 
19°3241 
16°1034 
11°9504 
81873 
5°2821 
3°2480 
1°9185 
1°0942 
“6049 
*3250 
"1699 
0866 
0431 
"0209 
0099 
0046 
0021 
“0009 
“0004 
“0002 
‘0601 


p=3 
25°7195 
30°7099 
22°1018 
12-2410 
56902 
23122 
"8391 
‘2751 
"0820 
(0222 
‘0055 
‘0012 
0002 
“0000 


19°4782 
27:0530 
22°8617 
15°0234 
8°3826 
4°1420 
1°8546 
"7627 
2904 
"1029 
0340 
0105 
“0030 
0008 
“0002 
“0000 
“0000 


6°1206 
12°6198 
16°1034 
16°2729 
14°2388 
11°2686 
8°2677 
5°7108 
3°7517 
2°3606 
1°4298 
8367 
4743 
2610 
1396 
0726 
0368 
‘0182 
0088 
0041 
0019 
“0008 
0004 
“0002 
“0001 
“0000 


p=4 
18°0421 
27°3365 
23°9720 
15°7316 
8:4901 
3°9438 
1°6163 
5926 
"1959 
0585 
‘0158 
‘0039 
“0008 
“0002 
“0000 


12°7149 
22°3854 
23°0250 
17°9083 
11°5877 
6°5376 
3°3018 
1°5167 
6398 
2494 
“0901 
0302 
0094 
0027 
‘0007 
“0002 
“0000 
“0000 


2°9657 
7°7231 
11-9504 
14°2388 
14°3919 
12°9527 
10°6753 
8°2014 
5°9437 
4:0975 
2°7034 
1°7147 
1:0490 
*6205 
3557 
"1978 
“1068 
‘0561 
0286 
0142 
"0069 
0032 
‘0015 
0007 
“0003 
“0001 
“0000 
“0000 


p=s5 
12°5748 
23°2150 
24°1218 
18-3785 
11°3384 
59480 
2°7262 
1:1089 
-4039 
"1323 
0390 
0103 
0024 
-0005 
0001 
0000 


8°2378 
17°6525 
21°4900 
19°3831 
14°3204 
9°1130 
5°1406 
26162 
1:2147 
“5181 
2038 
0741 
0249 
0077 
"0022 
“0006 
‘0001 
“0000 
“0000 


1:4210 
4°4875 
8°1873 
11-2686 
12°9527 
13°0951 
12-0038 
10°1734 
8:0780 
6:0662 
4°3381 
2°9694 
1:9531 
1:2381 
"7582 
4493 
2580 
1437 
0777 
“0408 
0208 
0103 
0050 
0023 
‘0010 
0005 
“0002 
‘0001 
“0000 


Successes 


aa 0 
m= 10({ 1 


SDTAN] At Ss & 


ioe 


SUMUANAAS SMHS 


ia’ 


Successes 


n=100 
m— 9 


MA Os WHO 


me 
SOM ANAASK WONRD 


11—15 


p=0 
90-9910 
82719 
"6830 
‘0506 
‘0033 
“0002 
“0000 


p=10 


33°5855 
36°9441 


20°1513 
771284 
1°8005 
3376 
0474 
0049 
‘0003 
“0000 


p=0 


95°2830 


4°5373 
1745 
0051 
“0001 
“0000 


87 ‘0690 


11°3568 
1°3947 
1604 
0172 
‘0017 
“0002 
“0000 


83°4711 


13°9119 
2°2212 
3388 
"0492 
‘0068 
0009 
‘0001 
“0000 


p=1 
82°7191 
15-1778 
1°8972 
"1891 
‘0156 
‘0011 
‘0001 
“0000 


p=15 
19°6056 
33°0200 
26°8727 
13°8698 
5°0127 
1°3220 
‘2571 
0363 
0036 
“0002 
“0000 


p=1 
90°7457 
8°7256 
5083 
0199 
“0005 
0000 


Tiae7AAl 
19°9243 
3°7027 
*5730 
0774 
0093 
0010 
‘0001 
“0000 


69°5592 
23°3813 
5°6472 
171584 
2122 
0354 
0054 
0008 
‘0001 
“0000 


Biometrika 1x 


M. GREENWOOD 89 


TABLE—(continued). 


75°1302 68°1737 61°8023 55°9719 50°6412 45°7719 41°3280 
20°8695 25°4855 29°1520 31°9839 34-0855 35°5510 36°4659 
3°5107 5°4097 7°4962 9°6874 11°91384 14°1158 16°2472 
“4416 *8243 1°3455 2°0065 2°8031 3°7269 4°7658 
"0442 ‘0971 *1829 *3098 "4857 "7174 1:0109 
‘0036 “0090 0194 ‘0368 ‘0641 "1044 *1609 
“0002 ‘0007 “0016 “0034 ‘0065 ‘0115 ‘0194 
“0000 -0000 ‘0001 “0002 “0005 ‘0010 ‘OO17 

—= = ‘0000 “0000 -0000 “0001 ‘0001 

— — —_ — — “0000 “0000 
p=20 p=25 p=30 p=35 p=40 p=45 p=50 
11°0992 60712 3°1945 1°6083 “7697 °3473 “1463 
25°8982 18°5708 12°3788 7°7198 4°5082 2°4580 1°2434 
28°8081 26°8613 22°5639 17°3696 12°3485 8°1229 4°9313 
20°0784 24°1644 25°4567 24°1112 20°8229 16°5036 12°0167 
9°6930 14°9554 19°6711 22°8554 23°9308 92°8255 19°9224 
3°3813 66468 10°8709 15°4516 19°5797 22°4513 23°4799 
*8619 271464 4°3483 7°5418 11°5470 15°9030 19°9224 
*1583 *4968 1°2424 2°6232 4°8456 8°0093 12°0167 
“0200 ‘0788 "2495 6221 1°3845 2°7446 4°9313 
*O015 ‘0077 0292 ‘0908 2.432 5778 1°2434 
‘0001 “0004 ‘0017 “0062 °0199 0567 *1463 
p=2 p=3 p=4 p=d p=6 p=T p=8 p=9 
86°3830 82°1896 78°1607 74:2914 70°5768 67°0123 63°5933 60°3153 
12°5800 16°1156 19°3467 22:°9874 24:°9514 27°3520 29°5020 31°4142 
‘9867 1°5956 2°3216 3°1517 4:0737 5°0756 6°1463 7°2749 
0488 ‘0957 "1642 2573 *3780 "5287 BAL 9287 
‘0015 0034 ‘0067 ‘0119 ‘0197 0306 *0454 0649 
“0000 ‘0001 ‘0001 ‘0003 “0004 ‘0008 0013 *0020 
65°7500 57°0221 49°3852 42°7116 36°8873 31°8110 27°3928 23°5527 
26°1836 30°5476 33°3684 34°9458 35°53836 35°3456 34°5611 33°3293 
6°5459 9°6321 12°7407 15°7096 18°4248 20°8110 22°8233 24°4415 
1°2777 2:2767 3°5456 50426 6°7156 8°5076 10°3611 12°2207 
“2091 "4386 “7879 1:2724 1°9006 2°6738 3°5865 4°6273 
“0295 0715 *1458 2641 *4381 6787 ‘9959 1°3973 
‘0037 ‘0100 "0229 ‘0461 “0842 *1428 "2278 *3459 
‘0004. “0012 ‘0031 ‘0068 ‘0137 0252 0435 ‘O711 
‘0000 ‘0001 “0004 “0009 ‘0019 ‘0037 ‘0070 ‘0122 
— ‘0000 “0000 ‘0001 “0002 “0005 “0009 ‘0017 
= = — 0000 0000» -0001._:~S0001._—S “0002 
= = = = -— 0000 =~ -0000-—-0000 
57°8686 48°:0604 39°8449 32°9751 27°2403 22°4613 18°4859 15:1848 
29°4247 32°8618 34°3491 34:4088 33°4530 31°8036 29°7094 27°3600 
9°5567 13°4563 17°0252 20°0718 22°4994 24°2787 25°4270 25°9920 
2°4716 4°2124 6°2724 8°5261 10°8479 13°1236 15°2562 17°1690 
*5480 1°:0993 1°8873 2°9118 4°15385 DD Mho: 7°1382 8°7832 
‘1077 *2490 "4853 *8394 1°3291 1°9649 2°7495 3°6775 
‘O191 “0500 “1093 *2099 *3658 *5913 “8994 1°3010 
‘0031 ‘0090 0219 "0462 ‘0881 *1547 *2545 *3965 
“0004 ‘0015 ‘0039 “0090 ‘0187 0356 0630 *1053 
‘0001 “0002 “0006 0016 ‘0035 ‘0072 0137 “0245 
‘0000 ‘0000 ‘0001 “0002 ‘0006 ‘0013 ‘0026 ‘0050 
= — -0000 ‘0000 ‘0001 “0002 “0005 “0009 

= = = —_ “0000 “0000 ‘0001 ‘0001 
= = = — — —- “0000 “0000 


12 


p=9 
37°2763 
36°9072 
18-2690 
59052 
1°3708 
2374 
0309 
0030 
“0002 
-0000 


p=10 
57°1739 
33°1007 
8°4512 
11813 
“0899 
*0030 


20°2198 
31°7739 
25°6636 
14°0361 
5°7796 
1°8884 
*5036 
1112 
0204 
0031 
“0004 
“0000 


12°4488 
24°8976 
26°0397 
18°8065 
10°4578 
4°7356 
1°8040 
“5898 
‘1675 
0416 
‘0091 
‘0017 
0003 
‘0000 


90 

Successes p=0 
n=100) O 80°1587 
m= 25} 1 16°0317 
2  3:1029 
3 "5802 
4 "1046 
5 0182 
6 0030 
7 “0005 
8 ‘0001 
9 0000 

10 

11 — 

12 — 

13 _— 

14 — 

15—25  — 


n Ea 0 66°8874 


m= 50j 1 22°2958 
2 7°3322 
8 2°3780 
4 *7603 
oO: 2396 
6 0743 
tf 0227 
8 ‘0068 
9 “0020 
10 ‘0006 
11 “0002 
12 = 
13 == 
1h a 
1) == 
16 = 
17 — 
18 == 
19 = 
20 = 
21 — 


j=l 
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p=2 


TABLE—(continued). 


64°1270 51°1981 40°7920 32°4330 
25°8576 = 31°2184 


7°5681 
1°9024 
4323 
0908 
0178 
00383 
0005 
‘0001 
“0000 


44°5916 
29°9273 
14°8625 
6°4708 
2°6038 
“9912 
3614 
1271 
0433 
0143 
0046 
“0014 
0004 
‘0001 


12° 
3° 
1: 


2826 
8912 
O701 


"2644 
‘0597 
0125 
0024 
“0004. 
‘0001 
“0000 


6280 
0284 
0189 
9693 


3333 
"3852 
‘0008 


"3987 
"1520 
0557 
0197 
0068 
0022 
“0007 
“0002 
‘0001 


p=3 p=4 
33°4361 33°5052 
16°5799 = 20°1081 
6°3556 = 90661 
2°0562 3°3806 
5855 —- 10922 
1501 3138 
0351 0815 
0075 0193 
0015 0042 
0003 “0008 
0001 ‘0001 
“0000 “0000 
19°6185 12-9456 
26°6919 22°1671 
22°3956 22°4728 
14°8274 17°4789 
84691 11°4896 
4°3589 66996 
2°0720 ~~» 3°5636 
9237 = 1°7600 
3901 8167 
1572 “3590 
0607 "1504 
0226 0603 
0081 0232 
0028 0086 
0009 0031 
00038 0011 
‘0001 0004 
= ‘0001 


p=5 
25-7320 
32°1649 
22-7047 
11°8013 
49929 
1:8077 
5764. 
1647 
0426 
0100 
(0022 
0004 
0001 
0000 


8°5121 
17°61138 
20°9746 
18°7745 
13°9817 
9°1228 
5°3759 
2°9173 
1:4771 
“7044 
3185 
1373 
0566 
0224 
0085 
0031 
0011 
0004 
‘0001 


p=6 
20°3711 
29°9575 
24°3723 
14°3734 
6°8150 
2°7378 
*9606 
*3000 
"0844 
0215 
“0050 
‘OO11 
“0002 
“0000 


5°5769 
13°5550 
18°5789 
18°8406 
15°7005 
11°3492 
73484 
4°3512 
2°3900 
1°2301 
5978 
2758 
"1213 
0510 
0206 
"0080 
0030 
‘0011 
0004 
“0001 


p—l 
16°0915 
27°2737 
25°1'757 
16°6391 

8°7536 

3°8700 
1°4841 
“5035 
"1531 
0421 
0105 
"0024 
“0005 
‘0001 
‘0000 


3°6405 
10°1832 
15°8126 
17°9434 
165656 
13°1572 
9°2958 
5°9710 
3°5398 
1:9578 
1:0184 
“5012 
2345 
1046 
"0447 
‘0183 
0072 
0027 
“0010 
0003 
0001 


p=8 
12°6823 
243890 
252300 
18°5020 
10°7117 
51757 
21565 
‘7910 
‘2589 
‘0763 
"0203 
‘0049 
0011 
‘0002 
‘0000 


2°3676 
7°5029 
13°0370 
16°3894 
16°6252 
14°4085 
11°0430 
7°6559 
4°8771 
2°8874 
1°6022 
8386 
*4161 
*1965 
“0886 
0382 
0158 
0063 
0024 
“0009 
0003 
“0001 


p=9 


9°9724 
21-4922 
24°6693 
199086 
12°5970 
6°6134 
2°9790 
1/1761 
*4127 
"1299 
0369 
0095 
0022 
0005 
‘0001 
“0000 


1°5339 
5°4395 
10°4710 
14°4635 
16-0095 
15°0512 
12°4505 
9°2753 
6°3248 
3°9946 
2°B3574 
1°3088 
6871 
3425 
"1627 
0738 
0320 
0133 
0053 
0020 
0008 
0003 
‘0001 


p=10 


7°8232 
18°7076 
23°6306 
20°8423 
14°3291 
8°1327 
3°9431 
1°6693 
6260 
2099 
0634 
0173 
0043 
0009 
0002 
“0000 


9900 
3°8892 
8°2261 

12°3988 
14°8876 
15-1066 
13°4281 
10°7081 
7°7895 
5°2324 
3°2752 
1°9239 
1°0664 

5601 

2797 

1332 

‘0606 

0264 

0110 

0044 

‘0017 

“0006 

0002 

0001 


ON THE PROBABLE ERROR OF THE CORRELATION 
COEFFICIENT TO A SECOND APPROXIMATION”™. 


By H. E. SOPER, M.A. 


(1) Iv is very important in determining whether the coefficient of correlation 
as found by any particular method differs significantly from the calculated value to 
know not only its standard deviation but also to have some idea of the nature 
of the frequency distribution. When the numbers dealt with are large, then, 
provided r be not nearly +1, we may quite legitimately assume a normal 
distribution and calculate the frequency of r on this basis. But if n be small, 
or if r have a value near either end of the range, then the usual values for the 
8.D. of r are not applicable and what is more in the latter case the frequency of r 
is of a markedly skew character and differs widely from a Gaussian curve. In such 
case the value of r found from a single sample will most probably be neither the 
true r of the material nor the mean value of r as deduced from a large number 
of samples of the same size, but the modal value of r in the given frequency 
distribution of r for samples of this size. In this paper the following notation will 
be used: 


p=correlation coefficient of the material from which the sample is drawn ; 
7 =mean value of correlation coefficient for V samples of size n; 


* = modal value of the correlation in the distribution of the values of 7 as found 
from WV samples of size n ; 


r =correlation coefficient of any arbitrary sample of size n. 


The first question we have to answer is what is likely to be the distribution 
of the r’s. Clearly, when p differs from unity, it must be a skew distribution of 
limited range lying between +1 and —1. The general skew curves discussed 
in Phil. Trans., Vol. 186 a, pp. 343—414, have proved themselves so capable of 
describing all sorts of types of frequency that one naturally turns to them in the 
first place in the present problem. There appears very little chance of successfully 
determining-—at least for a product-moment table—the distribution of r, We 
must start with the assumption of a reasonable frequency distribution-and justify 

* The frequency-distribution of the correlation coefficient in small samples was first discussed by 


“Student” in his paper in Biometrika, Vol. v1. pp. 302-10; he invited further mathematical investiga- 
tion and to a large extent supplied the impulse and direction to the present paper. 
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it a posteriort by means of experimental samples for given p and given n. Now 
the only type among the skew curves mentioned applicable in the present case 


is of the form : 
gym ae\me : 
Giro (1 = =) (1 ar =| Hudnedodoanepnebue nogoo000 (3), 


where, if the origin be at the mode, we must have 


My Oy M3] Og sieac gee cee de seterenee cea eee (i1) 


Now if we suppose p to be positive, we clearly have 


a=1—-7, a,=1+%. 


=i —— +1 
Cc 


Hence from (1) and (11) 


x my “ Mg =. 
Y= Yo (1 = i) (1 a 7) ecco c ces ccerevercece (iii), 
F= (mM, —M))/Gng +m) <2 iicce scene eeee eee (iv) 


Now let o, denote the standard-deviation of the distribution. Then we easily 
find (Phil. Trans., loc. cit. p. 368) 
ga 2hm+) ae - 


= tao Defeats ae euinase cctnce ost CREE (v), 
of= hen, ne a je (vi). 
Thus T= (Mm, =, (Ny Eg 2) Ses ce cee ee eae ee eee (vil), 
1—-r=4(m, +1) (m4 1)/(m + m, 4+ 2), 
and Cf = (= 77) Gry Hg eB) eta nec clecte io acre eee (viil). 
It follows that 
m+m,+ 3 =*=F ay, say, 


m— my =F PoP 1b =F =D). 


2 


Oy 
Accordingly m=s(A-1)1—7)-1 p 
ih =4 2D | sjasiclacton «ce ae eee nee (ix). 
Substituting in (iv) we have 
F=F7 (A-1)/(A-8) 
ae 52) ee Seiseiseisleins Caines nen meen (x): 
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Since o,2=(1—7)/A, and must grow very small as the number in the sample 
grows large, i.e. X grows large, we see that * and 7 rapidly become equal as the 


sample increases or the distribution becomes symmetrical. 
The value of y, can be found from (Phil. Trans., loc. cit. p. 369) 


a my” Mg" (m, +m,+1) I (m, + m,+4 1) (xi) 
(m, + My) tM T (m, a 1) Tr (mn, rs 1) Se iSa tetera i 


The problem of the distribution of r would thus be completely solved, if we 
knew : 


Yo = 


(a) r in terms of p, 
(b) o,? in terms of p. 


Using Stirling’s Theorem we can reduce the expression of y, to 


- 1 ee ts 1 1 
A (m +m,+ 1) Vm, + mz ae my) git mytM, Mm =a) 
2 Ir Vm, My 


tee tall +a) Omen) O+memeD 
~ / 2a, a/ =) +5) Uti a = m+m,+1 
1 1 1 1 
x el2 (= +m, my =) 


oy ee oer at) (xii) 
= orca {t+ ia lie + amg Tom ama) f ores 


This approaches rapidly to the Gaussian value J, |(V270,), if co, be at all small 
and therefore X and m, and m, large. For most purposes it is sufficient to take 


N ; g \2A-1)(1-7)-1 i ge \2Q-1)(1+7)-1 i 
=75-7( - 75) ( +73) Saene (xiil), 


gos 


where the relation between * and F is given by (x) and it only remains to 
determine 7 and o, in terms of p. 


(2) Now the product moment value of the coefficient of correlation, p, between 
two measured characters in any population is defined by 


Pu — P10 Por 7 

pare (Dao — Pio) X (Poe — Por") 

Pro, Px» being the first and second moments in respect of the first character, 

Pu» Po those in respect of the second character, and p, being the first product 

moment, all derived from measures of individuals taken from some arbitrary 
origins of measurement in the two characters. 


If samples of number n are selected at random the moments will have 
different values py, Po etc. and in consequence the coefficient of correlation a 
different value, r, in any sample, and 


Di — Pw Dor 
ae, == io ayi n/ (Poo — Pu’ 3) 
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The mean values of Py’, Po etc. in all samples are py, Po ete., since the 
moments are crude and simple averages of individual values. Let dp, px, 
dp, Ape, Ap, be the deviations of py, Po, Pus Po Pu from their means pj, Po, 
Pu» Po, Pu. The mean value of r we have called 7, Let dr be the deviation of r 
from its mean 7, then (xv) becomes 


. Put dp ae (Pro a Apo) (Pa te Apo) (xvi) 
n/ | Poo Ste Ap = (Pro or dpy)*} x NV {Poo ar Apoo as (Po + dpa)’ <a ; 
Choose the fixed origin of measurement of each character to be the mean 
of that character in the whole population, then p,, =p. =0, and (xvi) becomes 


Put dpu = Apo Adpor 

A { Poo Ur Apo aa (dpryo)*} x {Poe Ur Apoo = (dpu)?} 

If the distributions and correlations of the deviations of the moments in 
samples of x are known this is the equation for determining the distribution of 
the values of the correlation coefficient. The average value of the right-hand 
side of (xvii) will be 7 The average values of the square, cube etc. of the 
right-hand expression will give the crude second, third etc. moments of r from 
which the moments of deviations from mean value of the correlation coefficient 
can be derived. 


Tr+dr= 


f+dr= 


Now if (xvii) be expanded in powers and products of the deviations it may 
be anticipated that the average values of terms of higher order in the deviations 
are of higher order in 1/n, and that there is a limit to the number of terms needed 
to give a required approximation. The approximation sought in the crude moments 
of r is to terms in 1/n? only, in order that the moments from the mean may be to 
terms in 1/n?, and so that o,? for instance, which is known* to have the value 
(1 — p*)*/n to the first approximation for normal frequency, may be further carried 
to a term in 1/n*. 


Thus the process for determining 7 is to expand (xvii) and to find and insert 
in it the average values in samples of n of the various powers and products of the 
deviations of the moments involved, carrying the process on as far as is necessary 
to gather in all significant terms as defined above: and a similar process applied 
to the squares and cubes ete. of (xvii) determines the higher moments of 7. 


Were the samples sufficiently large these deviations would approximate, as has 
been shown, to normal distributions, and the known properties of such distributions 
could be utilised in evaluating the complicated mean, but we are dealing with 
small samples where the deviations are not so distributed, and it is necessary, 
in the first place, to evaluate these moments of deviations in terms of the higher 
moments of the whole distribution without making any assumptions or any 
approximations within the limits assigned. After this is done the distribution 
of the two characters will be assumed normal and the results expressed in terms 
of p, the coefficient of correlation of the material examined, and n, the number 
in the sample, only. 

* Biometrika, Vol, 1x. p. 5 (if B=,’ =3). 
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The method adopted in this paper is that of grade groups. It is well known 
that if in an indefinitely large population the fraction / fall into a certain grade 
of a character or combination of grades of two characters, then in taking random 
samples of n the numbers of this group to be found in such samples follow the 
binomial distribution of frequency 

n(n—1), p 
ie + nye el —f) ar ue Yea Cl --f/P +...4¢(1 —f)", 
and that the mean number is nf and that the deviations from this mean number 
have moments 


mean (df= ~ f(1—f), 

» @=5fA-Na-%), 

» UP =2PA-fP+ 5 fA A-—OF + GP, 

» fP=SAA-fPA- 2) + SAA fd 3) — 14¢ +197, 


etc. the fourth moment being the last which gives terms in 1/n? [see Pearson, 
Phil. Trans. Vol. 186, p. 347 and Phil. Mag. 1899, pp. 240, 241]. Here df is the 
deviation from mean value f of the frequency of the group in a sample of n. 

Moreover if fi, fi, fs... are the totality of frequencies of the various detached 
groups into which the population is divided by the graduation (which in our case 
is a double one) of character the various product moments of the deviations in 
samples of n may be deduced. These and the above, as far as our approximation 
needs, are put in one table as follows: 


mean (df,P = =i (1—f) etc. ...(xvili), 
; af. df.=—* ff, » eeen(atix), 
; Gyan) an) ee (xx), 
2 i= -S AA = 7) 5 essex, 
tp taped de Aye » Sey 
, (df, = oh (1A oe eeciint 
» GY. G=-SKAA-A cay 
iy Gy RO fe ah) yxy), 
» fy dfs. Ufr=— = fff —3fd) ay 


» Of, afr. af,. df, = ad etsfs yo peer XR VI), 
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the last five values being approximate and wanting terms in 1/n? to render ine 
exact. 


The method of derivation of the product from the power moments is illustrated 
in the following example. 


Mean (df,)’. df2.df; = mean {(d/,)? x mean df,df, for constant df}. 


Now in samples of constant df, the number of 1’s is nf,+ndf, and of not 1’s 
n—nf,—ndf,, amongst which latter restricted population in the whole community 


Si 


the frequency of 2’s will be = and of 3’s Tae Hence the mean number 
~Sf/1 wohl 
of 2’s in such samples will be (n— nf, — ndf,) 7 and of 3’s 


(n — nf, — ndf,) a 5 


differing from the mean numbers in all samples, nfi, nfs, by —ndf, Ss and 


1—f, 


—ndf, — and the mean product of the deviations from such means in the 


ae 
restricted samples will be 


—(n-nf,—ndf,). Ay a, 


by (xix). It follows that the mean product of the deviations ndf,, ndf;, which 
are measured from the means of all samples, will be 


~(n= nf ndf) ag hat (- ndf,— 4) (- ndfi 5 4). 


: _, Jods hs ‘ Paatis 
1.e. NF a3 + Cee Fy + n? (df) fy 


in samples of constant df. Dividing by n? we get the value of mean df,. df, for 
constant df, and so obtain finally 


(fy pots , (AY —Sofs 1 alate 
mean (df,)? df,df; = mean — - Te Gl — frp + (GA) GAY’ 


which by (xviii), (xx) and (xxiii) 


8 ee Yaa a Sees 
_ pedidels + a5 1-f, t+ iadthehs 


Se Py 


to our approximation. 


The other formulae were arrived at in like manner but the process is lengthy 
and these formulae and the general formulae that follow have been verified by a 
shorter process, which however being less direct in method is not introduced in 
this paper. 

There is no necessity to take the products of the deviations more than four 
together, for these do not give terms in 1/n*. Did any products, five together for 
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instance, give terms in 1/n?, then mean (df, + df, + df; + df, + df;)” would have such 
terms, which is contrary to the formula arrived at above for mean (df). 

Having obtained the various mean products of deviations of group frequencies 
shown in equations (xviii)—(xxvii) the mean products of deviations of moments, 
formed by associating with such group frequencies their grade values, follow. 

Let a, a, ... be the values to be assigned to the grades 1, 2... in the 
formation of the moment p (these values will in the present case be the product 
of one power of one grade of one character with another or the same power of 
one grade of the second character). Then 

p= af, A Ay fo + as fs Aone 
and if ay’, a ... are the values proper to a second moment, p’, in like manner 
p=afitasfatasfrt..., 
and if in random samples of n deviations df,, df, ... in the frequencies lead to 
deviations dp, dp’, in the moments, all deviations being taken from the above 
universal values which are also the mean values in samples, then 
dp = a,df, + andf, + asdfs t+... 
dp’ =a df, + a df, + asdf; t+... 
and so 
mean dp. dp’ = mean [a,a,' (df, + azay (dfir +... 4 aa, dfidf, + a a,df,df, + ...] 


i : : ee ; ey jo OR 
=F [aa/ Al —f)+aa/fA—fi)+...-aa fA fr—a afi fa—-.-] 


1 nes won : : a Rs 
== [aay fitasae fot... —(afi + Aefot...) (Qi fit de fot -.)]. 


If then p is the u, vy moment defined by 

Diy = 1 OL fan HO Oa fia + On Ds” for + 2+. 
obtained by summing the products of the group frequencies f by the wth power of 
the grade value a of the first character and the vth power of the grade value b of 
the second character in that group; and p’ is the w’, v’ moment defined by 

Didg Op O fate! Us fist Cae Ob fig tea ee 
it follows that the first term in the above square brackets is 
GU De fin ats he BE? Fag ess. 

OY Puyu'v+, and the general formula for the mean products two together of 
deviations of grade moments is* 


1 ; mee 
mean dpuy. dpury = - Diary = Dur Dita. wetewasscoss (Xxvill). 


* See W. F. Sheppard, ‘‘On the application of the theory of error to cases of normal distribution 
and correlation,” Phil. Trans. 1899 (192 A), in which paper (p. 127) are given formulae for the mean 
products, two together, of errors of moments calculated from the means of samples. In the present 
paper, it should be noted, the moments of the samples are crude, being calculated, not from the means 
of the samples, but from the mean values of the measured characters in the whole population; and dp 
is the deviation in the value of the crude moment in any particular sample from its mean value in all 
samples, which is mean ay (f,+ 4/1) +42(fot+dfo) +...=Gfita@fot+...=p or the moment in the whole 
population. This latter is a true moment, the general means having been taken as the origin of 
measurement. 
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It will be observed that nothing in the proof prevents w’, v’ from having the 
same values as u, v and the formula is true for any second order moment whether 
power or product. 


In like manner if p, p’, p” are any three moments of the material sampled we 
have the equations of deviations 
dp = adf, + a,df,+ a;,df;+ ... 
dp’ = a df, + dy dfo+ a; dfst+... 
dp” = ay" df, + ay’ df, + az’ dfy+ ... 
giving 
mean dp.dp’ .dp” = mean [a, 1a," (df,)’ + ... all grades 
+ (aay ay” + a ay dy + Ay’) (Af)? Uf 
+ (dy Ae Ay” + ay! Age” + 0, Ade’) df, . (df, + ... all pairs 
+ (Ay Ay” + A, Ae" ds’ + Oy’ Aes” 
+ Ay ty’ dy + Oy Ags + a," ay a5) Af, .df,.df;+...all triads], 


and inserting the values from equations (xx), (xxi), (xx11), 
mean dp.dp’.dp” = = [a,a, a A(li—fA)C — 2A) +... all grades 


— (a0; ay” + ay ay dy + Qid,a2)) Afr — 2h) 
— (0, Ay Oy” + Oy! Ay” + A, Ae) fi fa (1 — 2ft) — ... all pairs 
+ (Cy Ay Az” + A, Ay” Ay + Ay’ hy” 

+ ay Oy" ig + Oy” Ags’ + Ay” Ay 3) 2f, fo fg +... all triads], 


and collecting terms of first, second and third degree in /’s and suitably 
commuting the /’s and a’s this is seen to be 


1 es 
ae [(a,a, a" fit...) 


—(aqaq fit...@"fAt...)—Qa’ ft...) (av fAt...)—(Q aft...) (Gfr+-) 
+2a fit.) Get. ft )h 
the sums being for all grades. 


If then p, p’ have the double grade values pyy, Pu previously assigned and 
p” stands in the same way for pyy where 


Pu’ = ay by” fa ar ay” be” fis A bg by” fos ives 


there results the general formula for the mean products three together of deviations 
in moments as follows 


mean Apyy. Apyry.d aay, 
/} Uv* Pu vs Pu'v" x. n Bi apayee yp i ins Putw v+v' Pur" 


— Putu’ vv" Pu — Pu't+u'v'+v" Puw a PA ey Ore Purr’ spsteraiane (xxix), 
where, as before, the values of the suffixes may be any the same and the formula 
gives power moments equally well with the product moments of the deviations. 
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Precisely the same process evaluates the mean products four together. We 
shall have, putting in representative terms only of each series, and using equations 
(xxill), (xxiv), (xxv), (xxvi), (xxvii), 


1 ; ; 
mean dp.dp'.dp”.dp’” = = [a,a, a," a," 3/2 (1 =A +... 


= 004" tg" Bf? 1 — fi) fo + ++ 

+ 40/019) ty" fifa —fi = fo+3fi fa) + + 
= 0,0; Aes” fi fo fs (1 — 3f\) — -.. 

+05 03°04 fi fofsfrt ---] 


ill , at LA Aa bed 
= a L(t Be eee) Og faectniecs ) 


A (QiGy fats) (0, Qy fares.) 

+ (aa fit...) (ai a" fit...) 

(Cr aad ee Ga a es 
—(qq" fit... Gaft...)G’fArt... 
—(qa,"fit+...) (a fit...) (a At.. 
=—(a,@ fit...) (aft =.) GQ fit «. 
=(@0, ft.) Gh...) G At. 
= (a a’ fit. aft...) (a’ fit... 
+3(M fit...) (av At...) ("A+ ...) (a fit ...)] 


on collecting terms and rearranging the associations of /’s and a’s as before. 
And putting into factors this 


lS 


1 , aw bd wt WL if Nf 
=—[{((aa/At...)-—G@ft...)(a’ At...) (qr aft...) — (fit...) (afi +...)} 


n 
+ {(aa"fAt...-—G@fAt...) "At. (aa fi +...) —-(@’At...) a” fit...} 
+ f(a” fit..J—(Gfit... a fit...) {(q’a"fit...)—(al fit... Ma" At...)}} 


And so again if the material is double graded and py, Puy Pw'v’, Pur” are any 
four moments involving products of powers of both grades, the general formula 


for the mean products four together of the deviations of such moments in samples 
of n becomes 


mean dpuy. Apu »dpury - Upury 


1 
= n pit tareey. a Puv Pu'v') (Pu'pw v0" Pu’ vy” Pur vy”) 
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eke : ‘ es) | j 
and it is to be recalled that this formula omits terms in A not wanted to the 


degree of approximation laid down. 


Comparing (xxx) and (xxvill) it appears that the mean values four together 
are equal, within our degree of approximation, to the sum of the products of the 
mean values two together of the complementary pairs making the four, the 
division being possible in three ways. 


mean A puyd Pury dpury pun = mean d Puy dpury x mean dpury 1 bpu'y! " 
+ mean dp,,dpyy X mean dpyydpyy 
+ mean dp,,d pyr X mean dpyy Apyry ++(XXX1). 


It is unnecessary to find the general formula for the mean products of 
deviations five together, which by p. 96 will contribute nothing within our 
approximation, and formulae (xxviii), (xxix) and (xxx) applied to the expansions 
of (xvii) and its powers are sufficient to evaluate the general formulae for the 
mean and moments of deviations of r as far as terms in 1/n?. 


It is not proposed to exhibit these general formulae for moments of devia- 
tions of r in terms of the higher moments of the given distribution at length, 
but to proceed at once to the simpler case of a normal distribution in the two 
correlated characters and reduce the higher moments to second moments and the 
coefficient of correlation, p, as such distributions, it is well known, admit. In 
order to reduce in this way the values of the various mean products at the same 
time that they are evaluated by the formulae (xxviii), (xxix), (xxx), the neces 
formulae of reduction are next obtained. 


The expression for dr involves dp», dp», dp, Apo. and dp,, and (xxix) shows 
that we shall require to reduce p, Ps: ---Ps0-+-Pso-+-Pso--- in the above way. 


Now it is well known that in normal distributions, following the Gaussian 
law of frequency, the odd moments from the mean in either character vanish 
and the even moments are derived from the second moment by a simple 
formula of reduction, from which there results that 


P50 = P30 = P10 = 0, Pos = Pos = Pa = 0, 
Po = 3D", Pow = 5. 3Poy', Ps = BDoo, Pos = 9. Bor’. 


And, utilising these results, the higher product moments of normal distributions 
in two characters may be derived from the first product moment and the second 
moments by two well-known properties of the Gaussian surface. 


If x, y are deviations from their mean value of two normally correlated 


characters the mean value of y for a given w is Puy and if y’ is the deviation 
Dp: 


20 
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of y from its mean value in the array the distribution of y' is normal; its second 


— Pa oa and its higher moments follow the same laws of reduction 
as above. Hence 
Pw = ™ ean gy? 


= mean {a x mean y” for given a} 


= mean |e" xX mean (= “+2 v) for given x 
P 


) v-1 
= mean {#2 X mean (P52 a ap +9 ea gery 
Poo” P20 
v.(v—-1 v-2 ; 
+ ‘ 5 Le gy + | for given a}, 
G 20 


and so remembering that mean y’, mean y’, etc. vanish, and mean y4, mean y, ete. 
reduce by the above formulae we have 


i\" v(v—1) /p 
Pw = ie Putoo + _ 23 (Be Pa Puty-20 


P20 
v(u—1) (uv — 2) (0 — 8) fpu\" Sik “| 
a 1.23324 ( = Puctv-s0 (Pe - Poo 
v! 11 TG r Pu® : 7 45) 
Samacaal Ps) Diesen. eo (pu -) See a ee (Cx) 


It follows that if w+ v is odd p,, is zero, that is 
P30 = Pa = Pie = Pos = 0, 
Ps = Par = Pe = Pos = Pris = Pos = 0. 
If w+v is even it is convenient to divide by suitable powers of py and p., and 


iS exhibit all the reduction formulae together as follows: 


¥/; Dep Doe Doo 
PolPa? = PulPo? =3, Pa/Poo® Pa? = Prs|Pao* Poa! = =e, Px[ PPro = 1 + 2p?, 
Deo! Pao =i pe = 15, Poi[ Poo! Poa! = Pis/ Pao? Dos! = 15p, 

Ps2/ Poo” Por = Pos/'P20 Poo = 8 + 12p?, Pss/Poo® Post Ot AO Os meee: a easeente (Xxx). 


putting p for 


If in like manner the numerator and denominator of the expression (xvii) 
for the deviation in r in terms of the deviations in the moments be divided by 


NV Doo Doo and we write 


Apr Apo dp poo dpu 
y= =}, OG a) A=, oo ? = 
V Doo Px» é V Doo Poe V Don V Doe 
it becomes f+dr= oe ie re een SEALY) 


V1 + a, — 0”) x V1 + 8, — 87) 
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When this is expanded, the mean values of a, 8;, a, 8., and y are of course 
zero. In the following tables the general formulae (xxviii), (xxix) and (xxxi) for 
the mean products two, three and four together of deviations of moments in 
samples of n are given at the head, the suffixes used and their composition in the 
several formulae are shown in the initial columns (omitting repetitions of dp and p 
to abbreviate the printed matter), and the resulting formulae for the mean products 
of the a’s, @’s and y’s required in the expansion of (xxxiv) are given in the last 
column, the reductions of the higher moments having been made by (xxxiii) to 
suit the case we are investigating, that of a normal distribution of the two 
characters in the material sampled. Since the first and second moments only of r 
are at present sought it is unnecessary to take products involving higher powers of 
y than the second. The four additional formulae to y* are inserted, however, to 
complete the formulae for the third and fourth moments when required. 


As an illustration, if in equation (xxix) we put 


u=0, v=2, w=l, v=l, w=1lpa’=7 
i 
we get mean dpodpudpyr = ne [ Pos — PisPu — PisPu — P22 Poe + 2Diapaule 


Dividing by psp” and using (xxxiil) it follows that 


mean {37 = [(3 + 12p?) — 3p? — 3p? — (1 + 2p”) + 2p?) 


nv 
1 

== 2 2 
aL + 6p"), 


the penultimate formula in table (xxxvi). It will be seen that the values to be 
attributed to the terms signified by the suffixes between the double rules are the 
right-hand sides of (xxxiii), the composition of the terms being shown in the last 
column of the formula. For 24 put 3+12p°, for 13 put 3p, for 11 put p, for 22 put 
1 + 2p?, for 02 put 1, and the formula for mean 8,7? may be written down without 
any division being necessary. 


The formulae (xxxvii) are derived from those in (xxxv) and the suffixes between 
the double rules are for reference to the first columns of the latter table. Thus in 
the penultimate formula look up 02 11 in (xxxv) and find 2p, Look up 11 11 and 
find 1+p%. 2px (1+p*)=2p + 2p%, the first of the three component terms added 
together in the last column of the formula. 
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mean pir . pyr! ome n Darra! ote’ Div Dule'| 
—|| Mean value | af x 
Uv Cea | as @ uv u'v’ Of 
V+v 
10 10 20 10 10 ay” 1 
Ol 01 02 01 Ol B? 1 
10 20 30 10 20 aya O 
01 02 03 01 02 | BBs 0 
20 20 || 40 20 20 | ay? | 2 
02 02 04 02 02 Bo? | 2 
10 Ol 11 10 O01 a8, p 
20 02 || 22 20 02 ay32 | 2p? 
10 02 12 10 02 ay Bz 0 
20 | Ol 21 20 | Ol a | O 
10 11 21 10 11 ayy | O 
Ol 11 12 Ol 11 Bry 0) 
20 11 31 20 11 ayy 2p 
Oy i) ai 13) 02) \ Fil Boy | = &p 
ila 11 22, IL 11 y i chee Nl Res ooe (xxxv) 
mean Ap C apy,’ c Ayn’ 
1 2 
= ne Beret otto” ~ Puta vty Pu! — Putul ote" Pu ~ Pu tu’ v'+0" Puw oF 2Duw Pu’ vv’ | 
~ - - +2x 
= = Mean value 1 
, "” , ’ Tena, spi 
uv |u’v’ | wy” . i i ot is 5 v | wy" oe) u'v’ = es a uv | uv | wo! | wv’ ef Be 
10 | 10 | 20 40 20 20 30 10 30 10 | 10] 10 | 20 || ay7ay 2 
Ol | O1 | 02 04 02 02 03 Ol 03 01 | 01 | 01 | 02 || BB: 2 
20 | 20 | 20 60 40 20 40 20 40 20 | 20 | 20 | 20 ay? 8 
02 | 02 | O02 06 04 02 O04 02 04 02 | 02 | 02 | 02 2° 8 
10 | 20 | O1 31 30 Ol 11 20 21 10 | 10 | 20 | O1 ajay, 2p 
10 | O1 | 02 13 11 02 12 Ol 03 10 | 10} O1 | 02 a1) Be, 2p 
10 | 10 | 02 | 22 20 02 1) 110) 12 10 | 10] 10 | 02 || ay"By 2p? 
20} 01 | O1 | 22 21 OL 2101 02 20 | 20} O1 |} O1 ahi" 2p? 
20 | 20 | 02 42 40 02 22 20 22 20 | 20 | 20] 02 ay"By 8p? 
20 | 02 | 02 24 22 02 22 02 04 20 | 20 | 02 | 02 ay[39” 8p? 
10 | 10 | IL 31 20 11 21 10 21 10} 10) 10)} 11 a,"y 2p 
Ol | O01} 11 13 02 11 12 O01 12 OL OL) O19 ad Bi’y 2p 
10 | Ol | 11 22, 11 11 21 Ol 12 LOT LOR FOR rt aByy 1+ ? 
20 | 02} 11 33 22 11 31 02 13 20 | 20 | 02) 11 ayByy 4p + 4p* 
20 | 20 | 11 51 40 11 31 20 31 20 | 20 | 20] I1 ay”y 8p 
02 | 02} 11 15 04 11 13 | 02 13 02 | 02 | 02 | 11 Bo*y 8p 
20} 11 | 11 42 31 11 31 11 22 20 |} 20} 11} Il ayy” 2+ 6p? 
02) j- 11, |) 11 24 13 11 13 11 22 02 | 02] 11 | 11 Boy” 2+ 6p? 
Ty ane ea 33 22 11 22 11 22 10) WO aie ala y 6p + 2p3 
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mean Ap,.dpyy«Apyry Apyry 
=mean dp,,.dp,y X mean dp, .dpyry” 
+mean dp,,,.dpy" X mean dpyy.dpyry 
+mean dp,,,.dpyry X mean dp,y.dpyry. 


7) 


= | Mean 1 

Laat Whaatt Wt tt PA Matt Watt Matt | taal My It Uwe Ld pelade value of ~ ne 
ww |wv | uv Uv Uv | UV uv Wwev WW Uv | wv uv UW UU UWwv Uv | 
= | | a | 
10 | 10} 10 10 | 10 | 10 | 10 10 10} 10 | 10 10 10 10 10 | 10 a! 3 
Ol | Ol] O1 Ol 01 |; Ol] O1 OLB Ol | Ore TO OL OL Ol 01s oll By 3 
10 | 10 | 20 20 10 | 10 | 20 | 20 | 10} 20 | 10) 20 | 10) 920 | 10) 20 aya»? 2 
Ol | OL | 02 | O02 || O1 | O1 | 02 | 02 | O1 } 02 | O11 02 | OL) 02) Orr) 02 By?Be? 2 
20 | 20 | 20 20 20 | 20 | 20 20 | 20| 20 | 20} 20 | 20} 20 | 20) 20 || a, 12 
02 | 02 | 02 | 02 || 02] 02) 02 | 02 | 02] 02 | 02) 02 | 02+-02 |-02)| 02-4) eA 12 
10 | 10; 10 Ol 10 10} 10 01 LON SOs LOR SO 10]; Ol 10; 10 a,°By 3p 
10 | 01 | OF }°0L. || 10) OL | O1 || OL 4) 104 On 01.) SO 104) “Ole nore xon 18,3 3p 
10°) 10° | OL | OF | 10) 10 | Ol 7) OL 105) Or 108) Ol 10 Ol sialon mor ay2B2 1+2p? 
10 | 10} 02 02 10 | 10] 02 02 10; 02 | 10; 02 10] 02 10 | 02 ay"Bo" 2 
901-20 | OL | O01 || 20'| 20) OF | OL ~) 20%)"Or | 20) -01 | .20°) sore roo) sor ay? 2 
20 | 20] 20 | 02 | 20] 20} 20 | 02 | 20] 20 | 20] 02 | 20] 02 | 20} 20 ay?Bo 12p? 
20 | 02 | 02 | 02 | 20) 02} 02 | 02 | 20} 02 | 02} oO2 | 20] oO2 | 02] 02 ayo 12? 
20 | 20 | 02 02 20 | 20 | 02 02 | 20) 02 | 20) C2 | 20) 02 | 20) 02 ag? Bo? 44 8p 
10 | 20} 20 | O1 10 | 20 |"20 | 01 |/10 | 20 | 20} 01 | 10)) ‘or 207/820 aya2B; 2p 
10 | O1 | 02 02 10 | O1 | 02 02 10) | 9025) O19) 702; WP LO) | 02> | .Ols 02 a8. 2p 
10 | 10} 20 02 | 10 | 10 | 20 02 | 10] 20 | 10) 02 10} 02 | 10 | 20 ay7arBo 2p? 
90 | 01 | 01 | 02 |] 20] 01 | O1 | 02 | 20} O1 | O1| 02 -| 20] O2 | O1} O1 |] «B26, 2p? 
10 |-01 | 20 | o2 4| 10 | 01 |: 20 | 02 | 10} 20 | 01} 02 | 10|--02 | O1 | 20 1) aye\as@; 23 
10 |:10,]: 20.) 11 | 10°) 10} 20°) A 10} 20 |} Los) Wy) 100 ah 10) 2Ona aiancarr 2p 
O1 | 01 | 02 11 O01 | 01 | 02 VE) Ot 902 9201) LL AOR LE Ors 02 ByBoy 2p 
20 | 20 | 20 11 20 | 20 | 20 11 20 | 20 | 20) 11 20} Il 20 | 20 ayy 12p 
02 | 02 | 02 11 02 | 02 | 02 | 11 02) 02 | 02) IL 02 11 | 02 | 02 Boe y 120 
10 | 20} O1 11 109-205 | S01 ut 10) O1 | 20} -11 TOM rela 20 | Ol aya,Byy 2p? 
10 \O14) 025 11 10 | Ol | ©2 | IL | 10) 02°) OL) It | 10) 11 || OL Oo a eareiers, 2p2 
10 | 10 | 02 ll LOM}, LO) | 02 11 10 | 02 | 10 11 10 11 10 | 02 ay"Boy 2p 
20 | O1 | OL 11 20 | 01 | Ol 11 | 20) O1 | OLY] 11 | 20) 11 | 01 | (OT il vaseiey 2p 
30 1:20 | 02) 11 || 20) 20°) 02) die 120712025)" 20") 119207) ie eos ao ay’Boy 4p + 8p? 
20 | 02 | 02 iil 20 | 02 | 02 11 | 20] 02 | 02 11 20) AI O23 102 ay39"7 4p + 8p 
10; 10) 11 11 LOM LOs) Pin al LOD ss LO 11 10 11 10} 11 ayy" 1+)? 
O1 | O01} 11 11 jj O01} O01) Ol 11 Oe es oll 11 Ol ll Ol | 11 By? 1+p? 
10} Ol} 11 ll 10 | Ol} 11 11 LON Vi Ol 10 11 CO) aki a,Byy" p+p> 
90 1,02 | 11-| 12- || 20.) 02) 11 ) 11 | 205) Tl | 02) |) Ta )o0e) ie 02) mikes anaes. 10p?+ 2p! 
20 |. 20) 11 11 20.) 20 | 11 11 20 | 11 | 20 11 207) 20} 11 || agty 2+ 10p? 
02 | 02} 11 11 02 | 02} 11 11 02} 11 | 02 11 02 TUL 02) 11 Bo?y* 2+10p? 
20} 11} 11 LAE 200) Th 1 11 PAO) || IR) ale jhe aL 20 11 HIE | aE ayy? 6p + 6p? 
O2 |) 0 fs PE De OR) AE TL ye O28 I eT AORN e Tete 1 Sie aati Bey’ 6p + 6p? 
TL iy war | a 11 HE) Ee a 11 JIT IE AL yh) SL ie aL et ll} il y3 2+4p?+2p4 

| 
sbeaiived Auceemeede (Xxxvil). 
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We are now in a position to expand (xxxiv) to terms of the fourth degree in 
a, 8 and y, take mean values of the a, 8, y products as found for samples of normal 
distributions and so obtain 7 the mean value of the correlation coefficient in 
samples of n of such distributions correct to terms in 1/n?. 

r+dr 
=(0 +7 — % 8) 
x (1 — $a, + pay? + $a." — fa, a, — ean? + 2a, bene ay” ee 
x (1-48. +387 + 3B? — 3B%B.— geRo? + 3B + 18 BB. + BB 
=p-4{p (a. + 8) — 2y} 
+ £ (4p (a? + BY) + 8p (a? + 82) + 292.8, — 4 (Gary + Bay) — 8,8;} 
+ ay {— 12 (aPa, + B22) — 5p (as? + B®) — 4p (028, + % 8,2) — 8p (a.2B, + 8.°) 
+8 (apy + Bey) + 6 (a2y + Bey) + 4a Boy + 8 (a,%8, + % 71 B.)} 
ae {48p (at + Bit) + 120p (a,2a, + 8,78.) + 35p (a4 + B.') 
8p (Aa Bs + 8,28.) + 20p (a3 Bo + 428°) + 82pa2B,? + 24p (4,28. + 2,7B,") 
+ 18p2,°8.? 
— 96 (@Pory + BY Bory) —40 (ay + Bey)— 32 (Bary + A8,?7) — 24 (42° Bory + O28?) 
— 64 (a3, + a, 8,3) — 48 (04,28; + 4, 8,82?) — 82a, 8, 0,82}... eee. (XXXvlil). 


Whence taking mean values in samples of n of a normal distribution, 
if 
T=p+— (8p + 12p + 4p? — 16p — 8p} 


ele 48p — 80p — 16p? — 48p° 


+ 32p + 96p + 16p + 16p? + 32p} 


- {288p + 480p + 840p 


+ 192p? + 480p? + 32p + 64p? + 96p + 72p + 144° 
— 8384p — 960p — 128p — 192p — 384° 
— 8384p — 192 — 64") 


1 
+ 128n? 


1 : 
= p— 3 p(1— pt) — > p (I~ p') (1 + 3p") g sigpawtnnnon tai. (XXX1Xx). 


Or, expressing the result in terms of n—1 (by changing n into n’+1 and 
expanding) we may write to the same degree of approximation 


eel ae 1 — 9p? 
r=) E Oe 51 AG oF ce ca eaieatne cea (xl), 


from which follows that 


ee Ch p)| 1+, Ho11-se— | Pe ieee (xli). 
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And, again, by squaring (xxxiv) and expanding we obtain in the same way 
r+ 2rdr+(dry 


= (p + 2py +? — 2pa, By — 2a, Bry -+ 71°B,’) 


x (1 = ay t+ a? + a? — 2a?a, — a? + at + Bay?a,? + a") 
x (a _ Be am Be ale Be a 28,°B. = Be a0 Bit ar 3BYBY cis B2') 
=p? + {2py — p°a — p*Bs} 
+ {ry — 2p (a, B, + Gory + Boy) + p? (a2 + Be + a2? + Bo? + a8o)} 
ar {- 2a, Biy — Aor” — Bory? + 2p (0, 8; + 0); By + ay? + Boy + 07 + Boy + a, Bor) 
+ p? (— 2a,%0, — 28; Be — a° — Ba? — a,” Bz — A, 8,? — 4.28, — a, 8.”)} 
= {a8 + 2a ai + 2a, 8, Boy + O24? + Bey Sits act Bory? ar a, Boy” 
+ 20 (— 438, — 4B — &) 0? By — OB, Bo? — 081% Bo 
— 20,? ay — 2B, Boy — aby — Boy — Boy — A, ByPry — A? Bory — Ay Boy) 
+ p? (a4 + By! + 38a,?a,? + 38) Bo + a! + Bot + 24,2 8, + 20,8,?8, 
+4°8,+4,8. + 4°82 +02 82+ a7 6. + a2 BP)} .c.cce dase ncs-eeeneeeen 
And taking mean values in samples of » of a normal distribution, 
7’ + mean (dr)? 
2 1 D} 9 9 
=p +> [1+ p?— 2p (p + 4p) + p'(2+4 + 2p*)| 
il 
+ 73(— 2 — 2p?— 4— 12p* + 2p (4p + 4p + 16p + 4p + 4p) 
+ p?(— 8 — 16 — 4p? — 16p*)} 
1 
+ ara + 2p? + 8p? + 2+ 2p? + 4 4+ 20p? + 10p? + 2p! 
+ 2p (— 6p — 4p — 2p? — 8p — 24 — 4p — 8p — 16p?) 
+ p?(6 + 12424 + 8p? + 2407+ 1 + 2p? + 4+ 8p*-+ 4)} 
1 
=p? += {1 — 8p* + 2p%} 
1 : 
+ 2-6 + 18p? — 12p%} 
1 
9 1 ) 9D 2 al 2 2 4 
=p? + (1 — p*) (1 — 2p*) + = (1 — p?) (1 + 4p? — 8%). 


And by squaring (xxxix), 


eee. te galtare 2 : 
P= pi—— pil —p’)— aap (lL — p*) (1 + 5p"). 
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Hence by subtraction, 
1 
mean (dr)? = “ (1 — p?)??+ an (1 — p’)? (1 + 5d p?). 


Or taking the square root 


15 (144+ e) 


op, = 


Vn 2n 
which may be expressed in like manner with 7 in the form 
1 = p? | l1p? a 
(3 || Uy ae a lonecacaneoneaccotn dca xliil), 
=e 


to the same degree of approximation. 

It appears then from the above results that if the coefficient of correlation 
existing between two measured characters in a large aggregate of individuals be 
computed from the product moment values in small samples, these values are 
subject to errors from a mean value, the standard deviation of which errors may 
be very approximately represented by the formula 

est 
Vvn—1" 


and with greater degree of accuracy by the formula 


l-p Ii? 
ee ee i 
74 ( 4n 


p being the coefficient of correlation between the characters in the material 
sampled and n being the number in the sample. 


Moreover the mean value of the correlation coefficients obtained from such 
small samples will be less than the true coefficient of the aggregate and will be 
approximately represented by the formula 


1— p? 
(eee 
p( 2Qn i) 


the defect being very small when p is large, and when p is small being of the order 
5°/, in samples of 10 and ‘5 °/, in samples of 100. 


On the other hand the modal value of the correlation coefficients, or the most 
likely value in a single sample, will be greater than the true correlation coefficient 
(that is to say numerically greater: the correlation being supposed to be measured 
positively). 

We have, by definition re i _ 


a 
and so from equations (x1), (xli1) putting n — 1=n’ 


1 — p” Wye p° —I 
iG ae es 
[> nv’ (1 a3 4n/’ ) (1 a3 =| 


using second approximations. 
14—2 
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And hence from (x) going now to third approximations, 
Be 1 = 1/r 
Se fea § 
L=p*, (L=p*)(1 ~ 9p) 
p {1 ~ ue Sn? 


x {1 es Lini sew ee) “F' 


I 


| 3 =) _ = 04) 27p"\> 
nv Qn? 


= fe {1 A! sate) (41+ 23p°) (1 — p?)) ; 


Qn’ Sn? 


The excess of # over true p is zero if p=0 and if p=1, but if nm is small and p 
fairly large the excess may be such as to make the modal value unity or greater 
than unity. If for instance n is so small as 4, n’ being thus 3, the above 
approximate equation gives 


F=p+hp (1—p*) + (hp + 2p") (1 — 6’) 


or F = 2:069p — “750p? — °319p° 
= *93 when p='5 
=105 , p=% 
= LA Se Oia 


The frequency distribution in the last two cases is of the J type, there being 
no mode within the range. The greatest frequency is at the extremity of the range, 
or at value unity. The interpretation of this result is clearly that such small 
samples as 3, 4 or 5, as might be expected, fail altogether to give by the product 
moment formula an approximation to the correlation coefficient. Under some 
circumstances the points which graphically represent the observed measures are 
more likely to be in a line than to have a configuration represented by any 
specified fractional correlation coefficient. This will happen if the correlation in 
the material has a larger coefficient than ‘6 (approx.) when samples of four are 
drawn: ora larger coefficient than ‘3 (approx.) when samples of three are drawn. If 
samples of two are drawn the coefficient of correlation is necessarily unity in the 
sample whatever it may be in the material*. All the distribution is concentrated 
at value unity and *# should in this case be infinite for all values of p. Our 
approximation, neglecting terms in 1/n* etc., cannot of course show this if n’=1. 
It gives * greater than unity and so a J type, but fails to show the complete 
concentration at unity, 


It appears from (x) that * will be infinite when X=3 and F any value other 
than zero, whilst * will be zero if 7 is zero and » other than 3. If 7, and therefore 


* Supposing the material ungrouped. If it is grouped some values will be indeterminate in small 
samples, viz. when all observations fall into the same group. 
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p, is zero and »=8, 7 is indeterminate. This case is a little singular and (ix) 
shows that m, and m, are zero and that the frequency is therefore constant or 
that all values of 7 are equally likely, from —1 to +1. Remembering that 


z= ae it will happen when o,?=4. But when p=0 the value we have found 


= 


for o,? is = which will be 4 when n=4. If therefore from material possessing 
nN — 


zero correlation samples of four are drawn, the values of the correlation coefficients 
in these samples should, if the above formulae and assumed type of frequency 
distribution are correct, be equally distributed in value throughout the whole 
range. 

Complete experimental confirmation of the distributions found in this paper 
for the product moment expression of the coefficient of correlation in small samples 
is difficult to obtain. The second order differences in the values of the mean and 
standard deviation of the distribution are necessarily small in large samples and 
comparable with the errors of sampling of such samples unless a very large number 
of samples is taken. On the other hand if small samples are taken the theory is 
not so exact and in addition the distribution tends to concentrate in certain 
grades if the original material is coarsely grouped, whilst fine grouping adds very 
much to the labour of computation. 


In a paper published in Biometrika, Vol. v1, p. 302, “Student” gives the 
results of a very painstaking investigation to determine experimentally these 
distributions. Material based on W. R. Macdonell’s measures of finger length and 
stature in 3000 criminals [Biometrika, Vol. L., p. 219] was formed having correla- 
tion with coefficient* ‘66 and samples of 4, 8 and 30 were drawn. Material with 
correlation zero was also constructed and samples of 4 and 8 drawn. 


The comparison of the actual means and standard deviations found in these 
experiments with those computed from the formulae in this paper is shown in the 
table that follows. 


1 2 8 J 5 

| a = 7 a | 
Number in sample, x . A 8 4 8 30 
Correlation coefficient in n material, a 0 ) 66 6666 


Mean correlation coefficient in samples, observed we | — | 5609 °6139 -661 
% calculated,7 ... | 0 0 | 5933 | -6317 | °6542 
Standard deviation of coefficient in samples, obser ved «.. | *D512 | <3732 || -4680 Spe |: 1001 
+3 calculated, a, | °5773 | °3780 | +4234 | +2453 | -1088 
Number of samples taken... ads ob ee ... | 745 | 750 | 745 750 | 100 | 
Probable error of mean = fies ome ... | 0143 | -0093 | -0073 | -0105 | 0060 | 
4) » standard deviation | me si ... | O101 | -0066 | ‘0052 | -0074 | -0043 | 


* The exact figure was °6608. 
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It will be seen that the ditferences of observed and calculated values are for 
the most part several times the probable errors. Estimated in this way case (3) 
p= 66, n=4, is the worst fit, the difference of the mean being five times and that 
of the standard deviation nine times the probable errors. 


From the values of 7 and oa, found above are calculated 


i 2 3 J 5 
eg as | 7 | 36140 | 9°9905 | 48-325 
oO," 
lal 0 es 
faa T= |- G 0 | 2°5259 8124 6831 


The frequency distributions of Type I to fit the numbers of samples taken in 
the experiments and the values of 7 and a, calculated will be of the form 


y= (1 —aym (1+ am 


when referred to the absolute origin and unit of measurement of r, and the 


constants* will be 


1 Qi) 83 i 5 
| = | Ne 
PPG cen gery = = ace 2 ~-46844 | 6557] 71825 | 
Ng=} (A— eee = | 0) 2 | 1:08243 | 6°3348 | 38°1425 | 
wi T (my + iz +2) SHE | | | 
(oe av*) _ _ | 372-5. | 703-12 | 202-25 1: 003: 
et ces oea aera Fae 372-5 | 703 02°25 95-131 0033889 


When the above frequency curves are plotted they appear as shown on the 
diagrams pp. 112, 115 and are seen to be in fair consonance with the frequencies 
observed in the experiments and shown by the rectangles upon the same diagrams. 
They are perhaps as good an expression of these frequencies as could be found 
amongst the type of frequency curve assumed. The case of p=0, n=4, for which 
theory prescribes a horizontal straight line is seen to be very nearly so in the 
experiment, apart from individual fluctuations. In p=0,n=8, the curve well fits 
the deviations from zero correlation observed. In p=°66, n=4, the asymptotic 
nature of the distribution towards the value unity which the fitted curve fore- 
shadows is borne out in the samples of four drawn. With larger samples from the 
same class of material the displacement of the mode and the skewness of the 
distribution resulting from the assumed types are corroborated in the tests. 

At the same time it must be admitted that there are considerable differences 

* For the special case of p=0, eqns. (ix), (xl) and (xliii) show that the curve is y=ypy (1 — 92/8 (m4), 
the form suggested for this case by “ Student,” Biometrika, Vol. v1. p. 306. 
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needing to be accounted for. The individual irregularities in the observations are 
more than would be expected in random samples of homogeneous material and it 
is probable that these jumps which make a good fit of any continuous curve what- 
ever an impossibility are partly due to grouping. 
material were too coarse there would be a tendency in small samples for statistical 
constants to centre round certain values. Another possible source of error is in 
the mixing and in the drawing out of samples. Although a great deal of trouble 
was undoubtedly taken in these experiments, yet there always seems room for a 


If the grouping of the original 


little involuntary order in repetitions intended to go solely by chance. 


The curves were planimetered and the following tables show the comparison of 
theoretical with observed frequencies of the grade values. It will be seen that 
the differences are not systematic but that the + and — errors are fairly mixed. 
On calculating the square contingency, y?, and deducing from this and the number 
of groups, n’, the probability, P, of these differences being purely that of sampling* 
such probability comes out in most cases very small. It seems legitimate in this 
instance to see to what extent grouping will smooth down the irregularities and 
yet show the general resemblance and, with this in view, the differences in the 
columns headed e’ are calculated and the grouping therein shown may be taken to 
indicate what is necessary to bring the probabilities within reasonable distance of 
expectation. 


n=20, y?=2093, P='341. 


p=9, n= A, p= 90, n= 8. 
ip aes rere Difference e eae | Observed Difference 2 
m quency e n - frequency e rs 
*925—1°0 27°94 22°5 — 5°44 1:06 6 — - 6 “60 
*825— 37°25 31:5 — 5°75 “89 4°] 3 - 11 30 
*725— 37°25 24-0 — 13°25 4°71 11°6 12 + <4 ‘01 
*625— 37°25 35°0 — 2°95 14 20°6 TIES -— 91 4:02 
*525— 3 34:0 — 3°25 28 31°9 28°5 — 3-4 36 
*A25— ' 47°0 + 9°75 2°55 493 46 | + 3°7 "By 
*325— " 30°5 — 6°75 1°22 51°6 47°5 - 4:1 33 
995-— ‘= 46°5 + 9°25 2°30 59°9 70 | +10°1 1°70 
*125— 5} 44:0 + 6°75 1°22 65°8 57°5 — 83 1:05 
_°025— + 32°0 — 5°25 “74 69°4 70 boar 36 ‘O1 
1:925— 5 45:0 + 7°75 1°61 70°3 60° | — 9'8 1°37 
1:825— 5 43-0 + 5°75 “89 67°9 aly |e Bs ‘19 
1°725— 5 41-0 + 3°75 38 63:1 76 +12°9 2°64. 
1°625— 5 37°0 — ‘25 “00 56°3 63 + 6°7 80 
1°525— a 44-0 + 6°75 1°22 471 42 = "5" 45}5) 
1:425— “ 40°0 + 2°75 20 36°9 33 -— 39 “Al 
1:325— ; 38°5 + 1:25 04. 26°1 29 + 2:9 "32 
1:225— re 32°5 — 4:75 ‘61 it} 333 20 + 4:7 1°44 
_1:125— ; 36:0 — 1°25 04 eA. 8 + 6 05 
J—1°125 46°56 41°0 — 5°56 83 1°8 1 - 8 36 
745 20°93 750 16°83 


i —20ey — 1683, ee — GO 
* Tables for testing the goodness of fit, W. Palin Elderton, Biometrika, Vol. 1. p. 155. 


Frequencies per ‘1. 


Frequencies per ‘1. 
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Fic. 1. 


Olbservect anc caluclate cdéslribution 
of correlation coefficterts (rr samples of 
for from 2 JOSsEessing , rero 
correlation Celweer the ltvo Characlexs. 


745 samples 5 


Values of r. 


Fic. 3. 


Observect anc calculate cléstr<lution 
of correlation coefficients trv samples of 
four from muatertal fjrossessérrg conmrelation 
66 


coeffectemt 


745 SL rrples 


7alues of r. 


ale 


Frequencies per 


Frequencies per ‘1. 
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Fic. 2. 


Olbsexvect & caloulatecl dislribhut® om 
of <ox oes SOUSA rls €7e SL 

ChE: POT 7rIce 7 SSCSSE 
sees ee relation rr peas the 7 
Gwo macle ms. 


750 samples. 


Values of ¥. 
Fies. 4 anp 5. 
150 <i > 
140 
130 Oper eclpee se wece Con letleclelea ta Cie rare 


of corrmclatio écterns Cre samples o 
snc Srome Truectlniail JOPSESSERG deen x ae 


750 samples 


loo scun ples. 


10 
0 
-10-9 -8 -7 -6 -6 -4 -3 -2 -1 0 4 


Values of r. 
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114. On the Probable Error of the Correlation Coefficient 
p= 66, n=4. 
: Calculated | Observed Difference e Difference 2 
frequency m| frequency e mv ée m™ 
905—1-0 230°3 175°5 : ; Vic wierd : 
-805— 98-9 136°5 eu MO Ne tee ey 
*705— 72°1 84 £ 2. on: di 
Res tie es ! +203 | 318 +203 | 3-18 
505 — 48°0 55 : 
-405— 40°2 45 Teeth ets J 2-0 03 
305— 34:3 24°5 ) : aoe. 
205 — 29°7 24°5 (ere sal) eed 
105 — 25°6 19 ; 
005— 220 7 SE ee ee 
T-905— 18°8 22 ; Bi 
1805— 16-0 12 ay Sa aie 
1°705— 13°5 13 ne a 
1°605— 11-2 3 Mig eae 5 iyi 16 
1°505— 9-0 12 : ; 
1-405— 6-9 16 {+12 eritgiee. 
1°305— 51 7 ‘ c 
1-205— 3:3 10 Fi ON Sco 
1°105— 19 4 K : 
1—1 ‘105 6 9 + 10°5 44:10 
- 745 84:17 4-27 
m=10, y?=8417, P very small. 
wW=4, x2= 427, P=-287. 
p='66, n=8. 
: 
; Calculated | Observed Difference e Difference e”2 
d frequency m frequency e “m e’ ™m 
925—1:°0 48°9 37°5 -11°4 2°67 e 
825 — 127°6 126 a8 02 \ ee ED 
*725— 135°25 137 + 16 02 + 16 ‘02 
*625— 120-0 125°5 + 5:4 24 + 5°4 "24 
*525— 97°25 105°5 + 8:1 68 + 8:1 ‘68 
495— 73°9 80°5 + 65 58 Beas cts) 
*325— 53°5 39 —14°6 3°96 
*225— 37°0 37 0:0 00 
125— 24°25 16 — 83 2°82 
_°025— 14°75 14 - 8 04 
1°925— 85 9 + 5 03 
1:825— 4°75 7 + 2:2 1:05 + 6:2 “Al 
1°725— 2:25 9 4+ 6°75 | 20°25 | | 
1°625— 9 3 + 2:1 5°22 
1°525— °25 2 + 1°75 12°24 
1°425— 10 2 + 1:9 36°10 
1°325— 
1-225 — 
_T125— 
1—1°125 
750 85:92 2°85 
n=16, y?=85:92, P very small. 
1 — Oy — 2 Spee a oe 
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p= 66, w= 30: 

fy Calculated | Observed Difference _ e 

d frequency m | frequency e m 
*795— 8°5 6 —2°5 “74 
*735— 15°8 17 +1°2 ‘09 
“675— 21°6 27 +5°4 1°35 
*615— 20°9 22 +1°1 06 
555— 15°8 14 alis6) ‘21 
_495— 958 10 + 2 04 
1—*495 76 4 —3°6 171 
100 4:20 


n=T, 77=420, P=-650. 


It is hoped that further experiments may be shortly carried out which will 
have regard to the points raised and show definitely whether the distributions 
theoretically arrived at in this paper are good presentations of fact and whether 
the application of the standard types of frequency curves to the distributions of 
statistical constants in small samples is justified. 


I am indebted to Professor Pearson for drafting the lines of this investigation 


and for critical supervision. 


ON THE MEASUREMENT OF THE INFLUENCE OF 
‘BROAD CATEGORIES” ON CORRELATION. 


By KARL PEARSON, F.RBS. 


(1) Bya “broad category” I understand one of a finite small number of groups 
into which we class a variable. For example: we may divide General Health into the 
categories Very Robust, Robust, Normally Healthy, Rather Delicate, Delicate and 
Very Delicate. These categories may be verbally defined or have their boundaries 
determined by quantitative limits as when we state that the limits of the Delicate 
coincide with so many weeks of sickness or of absence from work in the year. 
Again we may put into four to six classes the competitors in an examination, and 
the boundaries to these classes may be really percentages of marks gained. Such 
broad categories are very common not only in social investigations, but also in 
psychological records, and quite recently Dr G. A. Jaederholm, a Swedish 
psychologist, wrote to me asking what was the correlation between the true 
quantitative value of a variate in any individual and that individual’s category 
or class-mark. The answer is an obvious one, but I do not know that I have 
seen it stated or any discussion of it given. It of course assumes that at the back 
of the categorical classification a true quantitative value lies. 


Suppose a population of NV individuals divided into p classes, and that C, is 
the class-mark of an individual in the sth category, whose true variate is a. 
The problem is what is the correlation of # and the class-mark. Let %, be the 
mean variate of the group of n, individuals who fall into the sth class. Then it 
is Just as reasonable to call z, the class-mark as C,, for given one the other is fixed. 
We really want then the correlation of z and &,. 


We may either find this directly* or indirectly, and the latter is the easier 
course. Clearly if # be the mean value of w for a given class, then 


or L-xe= 


* Let « within the class=%,+.2,’, S=sum for classes and Y=sum within class. Then 
SZ (n,cX,) = SE {n,F, (T+ L,')} =S (n,X,2) + S {ZZ (ny x,')}, 
but = (n,@,/) =0, 
S= (ne %,) =8 (7,27), 
_ SE (n,vF,)  S(n,F2)_ 


Ee Fx THyTx 


Paes 


Sin iF 
N.0%,X0, No 


=0;,/0, as before. 
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is the equation to the regression line ; therefore 


Whence it follows that 


or the correlation of a variate with its class-mark is the ratio of the standard 
deviation of the means of class-marks to the standard deviation of the variate. 


It follows from this that in classifying a variable into broad categories the test 
of their efficiency, as far as number and arrangement are concerned, lies in the 
standard deviation of their means not differing widely from the standard deviation 
of the variate itself. 


(2) Clearly Giar cactie CORI) | BL Re reere racer eee ere (11), 
where S is a sum involving all classes. 


If before classifying into broad categories we have made a quantitative 
determination of these classes on a sample frequency, the values of #,’s can be 
determined. If this has not been done, or cannot be done, we are bound to 
assume some form of frequency distribution. Suppose we take a normal distribu- 
tion, then we know that 


12? 
N fts —=z-, 
nL, = —— re 20°dr= No, (25, = Ley 
Var oy! %s, ° 
Lend (ele,)? 
where ge nigee AE 
Ca 


and can be found from Sheppard’s Tables of the ordinates of the normal curve 
as soon as n;, etc. are known, for the z’s are the ordinates at start and finish of the 
sth class, reduced by the factor N/c,. 


Hence 


N = : 
o,= S i (25, — 2.) X O27} , 


or TeC= vy, S iz (25, — 2s, 4 dei iawsca OMe isccena eeu: (111). 


Thus r,¢, can be found at once from Sheppard’s Tables, when the totals of the 


broad classes are known. 


(3) Let us now suppose a second variate y and assume that the correlation 
of « and y for practical purposes is linear. Then clearly since a given « will have 
a constant class-mark, the correlation of y and C,, for a constant a is zero; that is 
to say that the partial correlation coefficient 


p "yc, — Tay Tx, 
x C : oa === es = 
See Ny = rey V1 —??, C, 
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Hence Ty C, — Tay Txc,= 9, 
7 
yC g 
or ea es TPB Neds GSAS ween s eee (iv). 
TOF 


In other words, to find the true correlation of # and any other variate y, divide 
the correlation of y and the class-mark of «, by the correlation of # with its class- 
mark. 

From (iv) we can deduce the correction for number of arrays when we find zy 
the correlation ratio of w and y on the supposition that r,, equals sufficiently 
closely my, 1.e. when we “find 7 by methods.” Let H,g, equal the value of 
found when y is finely classified and the mean of an array can be determined for y, 
but the arrays of x are broad classes*. Then 


Jaf Val 
Nay = y Cr = a eat Be ee = pHnDonUdoUnDOdovOaDOODGC (v), 
Tro, VS (n,a)|(No,’) 
Jak 
= Le a eo gnc: (v1), 


Ne 2 
V8, Gn Fa 


if we suppose the distribution normal. 


(4) Now let us consider two variables w and y given by their class-marks C, 
and C,. If we correlate y with any variable w we have at once by (iv) 


ro ba 
iy = Ue, ave ee (vii). 


taCs, 


Now as w is quite arbitrary put it equal to the class-mark of w or C,, then 


r eee 
"y 0, = Cys bees Selec sain ota oletainles stone (viii) 
yCy 
Substitute this in (iv) and we havet+ 
"CyC ; 
Toy = na ge BpsngehuGeNye conan NBd030ceddoo0e (ix). 
20, yy 


* The primary establishment of equation (v) is due to ‘‘Student.” His paper published in this 
number reached me, as I was writing this paper. I had obtained (iv) and (ix) and used them to correct 
contingency, but not to correct 7. 

+ A somewhat different proof of this formula may be obtained as follows: the partial correlation 
ay" C,Cy is clearly zero, for when x and y are constant C,, and C, do not vary... Hence 


. —2 —_” . =F a a 2 . . y = 
/} CG, ie! 7 vy) vr C,,) Cy "y C; 7 y Cyt ay (ee Cray Cyt xC, "y c,) = 


2 9° 99) n° Vi » 
ay "ey —"y C,["xC,="20,l"yo,> 


and if we substitute for 7 , and» we have 
yC, xy 


el C (1 — Pay) = “py = my) "y G ly C= eo, 


ay 
r Y 
= Cr Cy 


i , as before. 
se "yO," 2C, 


or 
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Thus 1/(7,¢,7,¢,) appears to be the corrective factor when we group the 
variates w and y both into broad categories, the ranges of which are of any 
nature. 


We have : _ S (np ®eYse)/N (x) 
Cy (pe Vs Ga %2)/N VS (mye) Reo iii irri) ) 
s (tae) 
and i SN (x1), 


Ms =) Ie ye 
Sea) *8(¥ o,}) 
where ny is the frequency of the cell in the sth category of # and tth category 
of y. 

It is obvious that r,, takes its usual value when the classes of « and y become 
very numerous, for both factors in the denominator are then unity. 


In the particular case of normal distributions, on the assumption that the 
product Z%¢ is sufficiently close to %,% to be replaced by it: 


S {me (25, = 25) (24, = ah 


Ns Nt 

Dg 
N Abe (ue ‘ 

S iz (25, — 2) S le (24, — at 


which admits of fairly ready determination from Sheppard’s Tables. 


(5) An approximate value of r,¢ can be reached in the following manner, if 
we suppose the range of the broad classes are all equal and given by /h. 
Assume a parabola y=at bet ca? 


to give by its area the three class frequencies ns, %s, %s41, the origin being at the 
start of the n,_, class. Its equation will be found to be 


Te 2N,1, — (nm, + 11n,_, _ Msi — BNs + 2M. (7) 4 fs — Wns + Ns—1 ("" (xiii), 


6h h h 2h h 
2h 
But NsEy_ =) yeas = bs (N44 + 86ns — Ns_4), 
h 24 
: Eo ee, bo Nga — Nea 
or are) Teme 


Thus, if x,’ be the distance from the origin to the middle of the sth class, 


= Dea 
Ej — By = — h-EH —,, 
24 Ns 


and if @, and 2, be measured from the mean of the whole population 


1) Msa a s+ 


a 4 Ns , 


Ly = Ls 
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Ng har, ss — N44 pet lee e Ns) 


Tie Ns FG Ns 


h h? h? 2 (Nga — Mo)? 
= No — — (Hs Ns_1 — Lea Nsyi) — zo MA — zo Mt + 5 
12 12 12 oe Ns 


Summing for all classes and dividing by No,” 


s (5 Be) = SO) by I? i een ms ? g (Ms 1— N44)? 
No?) Nog 1D Ge SP ae 5760, Ns : 
But by Sheppard’s Theorem, with contact at the tails of the order we are 
supposing : 
S (nsx 1 
Cy = zee ®) ~ 712 is 


Further, if we suppose 54, Ys, Yss1, Yst2 to be the bounding ordinates of the 
classes N41, Ns, Nsi Of the frequency curve, 


2 
As_) = Ns_y — Yysh =Ns_1— ce s+ we e Beall De 7 (ty py 5ns + Naa) 


Similarly Osa = Ns41 — Yoro = 4 (Mp4 — 5g + 4541). 
Here a,_, and a,,, are the excesses of n, and n; above rectangles ; and we have 


a7, 
Os) — s41 = ¥ (Mg_y — M41). 


Thus finally 


Ns Be\ _ 1 i h? (@; 10545)" : 
S ( ) => — 12 Ox + Tae — Nn, Cee racrcccee (xiv). 


Consider the last term; it may be written 


s (Gs — As_y + Oy — a) . 4 (a, — ace a = <2 W? 


Ns 


Ys-1 


< h >< h >< h —> 


Now the numerators are mean differences of what will be, if h be at all small, 
small areas, and these mean differences are divided in the one case by n; and the 
other by V, much larger quantities. Hence the last term is as a rule very much 
smaller than the second, and we have approximately 
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Thus we see how rapidly, if 4 be small with regard to oy, the value of the 
quantity 


and therefore of 7, C1 its square root, approaches unity. 


(6) It is desirable to illustrate the approach of Ty, to unity as we increase 
the number of groups. I have therefore worked out this approach for three and 
more symmetrical groups when (a) we use the approximate formula (xv), (b) we 
still suppose the classes to have their ranges equal but the frequencies to be given 
by a normal curve, (¢) we assume the frequencies in the classes and not the class 
ranges to be equal, the frequency being supposed normal, (d) we suppose the 
frequencies to increase by 50°/, at each stage, ie. to be as 1, 3/2, 9/4, 27/8, ete. 


These cases will be sufficient to indicate what sort of frequencies we should 
take for few classes in order to get the highest correlation between variate and 
its class-mark. I shall after these theoretical investigations consider a few actual 
cases of “broad” categories. 


r 2 ae Coy Vad , , i * 
Values of 1c, for Various Groupings *. 


ee a aul aves, Avy | ) Fiaual fees 
0. 0 ual Kanges, Any | a “qua nereasin 
Classes zy Frequency. i | N ck errr iel Subfrequencies, Subtequetaes 
Formula (xv) j Normal “Tedueney | Normal Frequency | Normal Frequency 
3 ‘817 (h=20) | 859 ‘891 876 
4 901 (h=1°5c) 915 928 912 
3) 938 (h=1°2c) 943 ‘947 928 
6 ‘957 (h=o) ‘960 959 “939 
8 978 (h="75c) ‘977 972 = 
10 ‘985 (h=‘6c) 985 ‘979 P — 
12 990 (h=‘5e) ‘989 ‘984 — 
Lh 992 (h=3c) 992 ‘987 — 
16 994 (h=20) 994 -990 = 
20 996 (h="3c) 996 992 = 


It is clear from the fourth column that nothing whatever is gained by 
exaggerating the frequencies of the extreme or tail sections. Further: 


(a) Equal frequencies are better than equal ranges up to about six classes. 
After six classes it is better to take equal ranges. 
(b) After six classes the approximate formula (xv) is amply sufficient in the 


case of equal ranges to obtain the value of r,¢_. 


Thus we have the general rule that up to six broad categories it is desirable to 
make the frequencies of those classes approximately equal, but beyond this greater 
* The number of classes given in (a) and (b) is based upon the range of 60 covering for such data as 


we usually deal with, i.e. 500 to 1000 cases, the total frequency. 
Biometrika 1x 16 
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exactness will be reached by equal ranges. The differences, however, between 
equal ranges and equal frequencies are after six classes so small—within the value 
of the usual probable error of random sampling—that either method will give 
practically quite good results. 


(7) It is interesting to note the relation of the present method to the usual 
correction for grouping in the value of the correlation. In that method we take 
the groups at their mid-points, and we do not correct the product, but only 
the standard deviations of the two variates by the usual Sheppard’s correction, 
jgth of the sub-range squared being subtracted from the raw second moment 
coefficient. Now we have 

ay =S (Ns ls Yt)/ Ca oy, 


and we have to replace a,, y, by @ and %. 


But we have seen that approximately with equal ranges 


24 Ng ; 
1,%4-7 

I= Ye— og aa =, 

Ms (Hs\") _ 1 he nN (= m Ie 
gue sty Glee 26,2’ ae = widens 
Now , 
S (ngs Yr) = S (nse Ls Yt) — is AS [Pema ets nay} 
24 Ns 
Le (ie ees 5 1 Oe Ns—1 — Ms41 Ney — Ney j 
= kS ae nc} +. B76 hkS {ne 7s Pee (xvi). 


Consider first the last term*, it contains not only the product of hk, but also the 
product of differences, and is of the form, when we divide by Nozo,, 


Y thie Nst a 4 (a,—a,_,) +4 (Gs44 = Os) ) 4 (a’, ne a=) ate 4 (a’t44 = ao} 
36 oz oy : 


N Ns Me 


which will be of the fourth order of small quantities if h/o, and k/o, be small and 
may be neglected as compared to the square of small quantities. 


* As a matter of fact for Gaussian frequency 


Ns_1—Ngy1 M4-1—-7N ae 

Pls stl 4-1 — 41  (, 

S (r% rae A ) =4hKS (Ny X,Y) 
Ns t 


to our degree of approximation. Thus the fourth term may be written = hk? S (nt, y;), Which gives 


oe Wop he tb ne 
S (254 Ts Hz) =S (gp Ve Y) ( PD i? Dp ke + 44 ae) 


teat rege 
=S (Ny, Y2) (1 - 19 nm) (1 - 19 ro) : 
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Next, considering the second summation and summing first ny; for all values 
of t, it equals ns¥/s, where 7, is mean of all y’s in array corresponding to a;. But 


ro 
“x. Thus the second sum equals 


if the regression be linear 7; = 
. 


i h 
94,7 Ty% G2 = S {(Ms1 — Ns41) Xs} 


94 CyFe — 318 (@yNs—1 — B41 Msp) + AS (ns) + AS (M641)} 


as before. But in a term of this order we may put 
Nroyon=S (ses Yt): 
Similarly the third summation 


Ly =: ; 
= 12 S (15422 


i) 


z ke Pe 
Or, finally, (sts Ye) = (Net le Ye) (1 - S52 dat 5) eee (xvi) bis. 
y 
nst Bs dL & O, h ke 
Th 8 G Gady cee Tait D Gig ~ 126,? ) 
ss 2) ( HE) aa TA : 
No? Noi Phony 
or, since we neglect terms of fourth order, 
gf st 
SHE) 8 (Gre) : 
ey, a ye ode vetee nie ues (xvi), } 


Tie NE Yt Ox Fy 
: F 2) (rea) 


the usual value, o, and o, having of course to be corrected by Sheppard. 


We see by (xv) that 


ot Ge z 5 (1 a = on ee schaneea nein (xvii) bis, 
Ox 
but by Sheppard 
oe Y NE 1 
ot=8 (500) (1 ipsa): 
Accordingly 
8 (a) =8 (Foe) (1— Fes) cence (xviii), 


which enables us with equal small subranges to use the standard deviation of 
means or mid-abscissae at our pleasure. 


(8) Of course it is absurd in practice to push our results to the extreme* of 
two categories only, but theoretically it is not without interest to note the results 
which flow from such an assumption. 


* We have seen that 7,7, can only be replaced by %,%, in (xi), or as a special case, (xii) used, 
provided the subranges are equal and fairly small. 
16—2 


124 Influence of “Broad Categories” on Correlation 


Let the division be into two categories containing V x (14a) and V x (1 -a) 
individuals, and let no assumption be made as to the nature of the frequency. 


Then 
8 & 25) =$(1+a)z?+$(1 —a) ay, 


N 
but 4(1+a)%,+4(1 —a)z,=0, 
, : Ns =» eo Clee) 
and therefore S (3 rn?) = %, ease 
pak Xo t(l—-a F 
Thus 20, = = fe (classi. one sa eee (xix). 


We note therefore that unless we suppose all the frequency in each category 
concentrated at one point—a very rare occurrence—r,q_ will always be less than 


unity, and therefore in correlating class indices a correction will always have to 
be made. We will consider two cases: (i) when the frequency distribution is a 
rectangle of length l. In this case #, is =/ {4(1 + @)} and o, =1/)/3, we have then 


TC, =V3 x$(1 +a) x $(1 —a) sig skis ald see cea (xx 


(ii) the distribution of frequency is supposed to be Gaussian. In this case 


ga 0X On 
2G=o) 
and ; '.¢.= == 4a 

° /}(l—a) xf $a) 
in other words, Eh is the reciprocal of the x, already tabled in this number of 


Biometrika, p. 27. 


We have the following results : 


Value of 7.0 | 
x 
Value of 4 (1+a) ° Pk Ae el 
as percentage of | 
total Gaussian | Rectangular 
Frequency Frequency | 
50 °/, “798 866 
60 °/, 7789") ||| | 849 = 7 
Ole s “759 |: 79a 
80 °/, 10019) |) 693 
90 °/, "585 "520 
95 °/, 473 | 377 
She) y 268 172 


It will be seen at once how rapidly the corrective factor 1/r,¢, rises as the 


division of the categories becomes more and more unequal. But in both cases 


very sensible correction is needful and its nature depends on the particular 
frequency. : 
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Let us now pass to the consideration of the correlation 7, as obtained by 
correction from the class index correlation, 1e. 


pike UGGS 
Cy ae ae 
Vy Ge Ko 


a yy 
The difficulty is to know how to determine the product S(ng% Ys) for such a 
slender division as a fourfold table. If we assume it still to be equal to 


Te peo 
S (Nse Vs Ut) 
we can then find ry. 
We have S (st Bs Ye) = MAY, + NyLoYo + NzL3Yg + Mss, 

where we take for our fourfold table: 

Ny | Ny Ny+tNy 

N4 Ng ns + Ng 

ny +n Ne +N3 N 
But clearly on the above assumption 
=H, B=X3, Y=Yo, Ys = Y4- 

Thus S (Nst@s Yt) = NAY + NoLoYy + N3LoYy t+ N47 Ys, 
then since (ny + 4) ® + (My + Nz) L, = 0, 


és! (n, + Ng) NY + (n; + 4) Ya => 0, 
we easily deduce 
—_ Ho Y4 (Ny Nz — 23M) a 
ASCO a a ree 
( st 7s Yt) (n ae i) (n, it No) -(XX11), 


Similarly we find 


2 — etm, 
"mtn, 


creases Some nahecea Goes (xxi11). 
r mtn) 
co = ‘ 
Yt n+, ds 
And accordingly 
a 
bey iy ae Dnt Ge Ut) 
Cz Cy T7975 
NN — NyN : 
ea = eee eee (xxlv) 
NV (My + Neg) (My + 4) (My + Ng) (Ms + Ng) 


This is not really the true value of the class index correlation unless we assume 
LsYt=FeYs, which is not generally true. It is really the correlation of the means 
used as class-marks, and as shown long ago by me it is the correlation of errors in 
the means of the two variates, and also equals ¢@ the basis of the mean square 
contingency. If we correct to obtain r,, by the factors 


TO, — Om, | Tx and TG oy,/Fy, 
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we find 
Ny Nz — NpNq 
V apy SS aa one (xxv). 


= ea 
(n, + 13) (my + ny) X = x a 
Cis ey 


Mr Yule has termed (xxiv) the “theoretical value of. the correlation*.” 
It will be quite obvious from the present discussion that it is only an approximate 
value even to the class index correlation, and cannot represent rz, at all, unless 
the frequency distributions each consist of two isolated points. Generally 


= Ny Nz — Nz . Tx Tx Ty Ty 
V (1, + 29) (2, + Ny) (Ng + 03) (My + 14) NV & &, “Ys 


Tay 
ae 
ae Oy Tx Dy Gy 
= OIG RA eae 
# ® Ly Yr Ys 
This corrective value on rg_g, is always true, although the above value 
x 

of r¢ ¢, 18 only approximate. 
This is brought out at once by considering what (xxv) reduces to in the case 

of Gaussian frequency. Here 


1 1 h? 
= = weoOura2 
(1, +n;) = N2.0,— 4 o EE 6. ime 
7 
1 k2 
Te age 


. y,— N = N i == ; 
(m3 +5) Yy ZyOy Cy ox 
where h and k are the distances of the dividing lines from the mean. Hence if 
H and K have their usual meanings (xxv) gives us 


Ny Nz — NyNq 


Tay — Vanice ~ ° 3 ielajeig ie ciclete ors lefetnere eis toleretaraeteye 


e is the expression used in my memoir on the correlation of characters not 
quantitatively measurablet. Thus the physical meaning of ¢ is rg ¢, corrected 


(xxvii). 


for the use of class indices, but not for the assumption that %» x Ys may be 
replaced by %,x%. We may therefore anticipate that e¢ or, failing Gaussian 
frequency, (xxv), will give a better approximation than ro,c, to the true value of 
the correlation of « and y. 


I shall discuss elsewhere the possibility of any absolute identity in the 
members of a class; the only thing in which I have personally come across it 
is in theoretical investigations—never in practical—on Mendelian units. When 
we are absolutely ignorant of the nature of the frequency, then I feel sure that 
in dealing with either physical or mental characteristics in living forms, the 
Gaussian distribution will in the long run be found closer than any single type 
of distribution hitherto discussed, and that accordingly even e—to say nothing 


* Introduction to the Theory of Statistics, p. 212. 
+ Phil. Trans. Vol. 195, A, p. 7. 
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of r by fourfold—would be far safer to use than r¢ ¢, or any arbitrary coefficient 
of association, which wholly neglects the question of the type of the frequency 
distribution. In the rare cases in which the frequency is collected into points— 
absolute homogeneity of category contents—then ¢,¢, 8 the right coefficient 
to use and not Yule’s coefficient of association. 


(9) I propose in this section to consider the relations between 


8 (FF. Let » Tt) S(y Xs. 7) and s(%. Lei ys) 


when the subranges for each character are small and equal. Let those for # be h, 
and for y be k. Then consider the surface 
zZ=a+t ba+cy+ da? + ey’, 
and choose a, b, c, d, e so that it gives by its volumes correctly the five class 
frequencies 
Ms-3,ty Ms ty Mstit, Ms,t-1y Ms, t41- 
Then if we take the origin so that it lies at the mid-point of the n,, group, 
we find 
il ‘ il 
ia Ms, t— 79 Enea t +s, 11 — 2%s, 1) — 4 (Ms41, t— M1, t+ Ms, t41 — Ns, t-1) 


1 M%i3,4—M%a,t , 1 ¥ Ns, tri — Ms, ta 


em = 9 oi ie 
Further 
Nst (sp — Ls) = ie sigs Bee anda = = bh — a (He nF ee ) ; 
Or Fig = He + 55 sy iat ee ULL Oe ee ee (xxviii). 
Similarly Tu = ye + gy beh Sere ee eee Cay 


Now take the product of these, multiply by n and sum, we have 


S (oF Za) = S (FF aay) + 5 ag bS | h (Ms, Clg —1, of 
TAY (er) 
+ 3g FS LT Oat | 


Ns t+1— Ns t+ Ns. t — Ns. F— n n Net 
+ gg eS ( 8, t+1 st st tts, t ) v, ( s+1,t — Ms, (ar 's,t — Ns, 4), 


2N Qt 
But if h and k& be small the last summation is, precisely as that dealt with on 
p. 122, of the fourth order, and we have ; 


S (5 Za You) =) Ga sy) Tide sieay cawoeshesutids (xxx), 
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for the second and third sums obviously vanish if we suppose zero frequencies 
at the boundaries of each row or column. Hence by (xvi) bis we have 


aitecas I? ke ; 
S iF Zia) = S (nes Ut) x (2 + 126, Pie Relea (xxx1). 


Thus we cannot replace S(nytYu) by S(my4%,y,) unless h and k are very small 
relative to o, and oy, respectively. 


: S (Nay 
But now consider 7, = MED the accurate value. 
ox Oy 


We may replace S(nz,ay) by S(nya,y,), there being no Sheppard’s correction 
for the product moment. Hence 


Ng, Mie oe 
S (Fay) 8 (Fe zTe) 
Vy = = Dy (xxx) 


Gy Ty On Cy 


— gi {Mat Be Ye he ni) 
io Sy Cx eA (1 a 1263 SE 12,7 by (xxx) 

Ne Bs Ve 

EF Sie) 
= (= Sy an 
(1 Gea) (2 ae) 

8 (Fe Wg ue 
AN ae oy) Dy (2 VAL) P0t8 es eae nee eee (xxxi1). 
8 (7 =) x 8 (Fy ut) 
NV G3 N oF 


This formula is free from all Sheppard’s corrections, and in form it does not 
involve the equality of the subranges. Hence it seems likely to give moderately 
good results even for unequal subranges, provided they do not differ very widely, 
and there are not too few of them. It leads as before in the case of normal 


g {ra Ce 


n,n (2 31 =2,.) (24, = “yt 


Tey = i PE a Gas a ) 


but it is clear that this cannot be pressed so far as to make only a very few, and 
those very unequal, divisions for each variate. 


to the same degree of approximation 


frequency to 


(10) The present method of correction seems likely to give good results in 
the case of the method of mean square contingency*, when we may reasonably 
assume the distribution not widely divergent from the Gaussian. 


* A much fuller discussion of all the corrections for contingency has been some years in progress 
and will shortly appear. 
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Illustration I. Thus I take the table for correlation of stature in father and 
son, the subranges being each two inches. 


Stature of Father. 


| 
| 
8 | 595-615 | - 15 1 | 35 | 
Qe 6l5 63-5 | 0 | 2°75 | 5:75 | 9 | 5 | 025) O85) — | — 24 | 
6 | 63:5—65:5 | 4 7°75 | 20 41°5 | 17-25) 8:25) 1:25) —" | — 100 
o | 65-675 | 2 10 | 32 73 TBO oes) lereo | al = 237° 
5s | 675—69'5 | — | 4:5 | 27-75] 65:5 | 95 93°25|31°5 | 4:5 | 1 323 
2 | 695—715 | 6°75 | 38°25 | 61 77°5 |39°5 | 11 2 236 
wm | 715—73'5 0:25] 5°75 |24°75|34:5 | 32-25] 7 0-5 105 
13:8—15'5 1 3 6:25| 6°75 | 18 55 | 2 375 | 
155175 eee belie: | 2"5 bi) 8 | 
TY 5 —19'5 | 2 | 0.) a = 3 | 
i 
Totals 237°5 | 291°5 257°5 | 127 2:3 Po) P1078 | 


Worked out by the product-moment method and using the Sheppard correction 
for the’s.D.'s, 7, = "5259. 

I then proceed to make it into two further tables (i) a 5 x 5, and (i1) a3 x3 
celled table. These are as follows: 


Stature of Father. 


= HS 2 i | 
= ion) ~ } ite} 
Ne) Ne) Ss ~~ ~~ 
a l l Totals 
a re glee = 2 = 
2B Se | 
» | 075-655 | 42-25 | 52 | 23-25 85 | 15 127°5 
H | 55675 | 44 73 78°75 | 335 | 8:25 237°5 
B | 675-695 | 32°25 | 65:5 95 93°25 | 37 323 
8 | 695—71-5 6°75 | 38:25 | 61 77° 52°5 236 
MQ | 71:5—79-5 1:25 8°75 33°5 44°75 65°75 154 
Totals 1265 |23y-5" 291-5. |267-5. | 165 1078 
and 
Stature of Father. 
d 58:5—66°5 | 66-5—68'5 | 685—765 | Totals | 
M Se | : 
oe 59:5—67-5 211°25 102 51°75 365 
S 67:5 —69°5 97°75 95 | 130°25 323 
g 69°5—79-5 5D 94°5 240°5 390 | 
~ SS 
ayy 
eb Totals 364 291°5 422°5 1078 


In obtaining these tables I was generally guided by an endeavour to roughly 
equalise the frequencies in the totals. 
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If x be the stature of father and y of son, then for the 5 x 5-fold table we have, 
reading from left to right and from top to bottom : 


&,=—1-6780, —‘7650, —-0694, +°6191, +1°5440, 
y,=— 16741, —-7617, —-0298, +6811, +1:5795. 


These give 


8 (Fo) = 917,091, 8 (yy Ur) = 86,6599, 
x y 
Ve Cis ‘9576, ‘y Cia 9293. 


I now calculated ¢? and found for raw V¢? the value 314°7514; deducting 
the correction for 5 x 5 cells, this equals 298°7514, whence ¢?=:277,1349, 


tery ’ 
Teepe 200° 


Taking this to represent the correlation of C,, and C, we should have 
Pay = °4659/(9576 x 9293) 
= "5235, 
which is excellently in keeping with the value found from the 9 x 10 table by 
product moment, and not very different from the value 5140 found from the 
original 17 x 20 table in inches by the same process. 

I now turn to the 3 x 3-fold table and find the raw WN¢? = 219°2194 
or less the correction for 3 x 3 cells = 215:2194, whence ¢?='199,6469. Thus 
C =-40795. 

For this table reading in same directions as before : 

#,=—1-:0823, —-0694, + -9803, 
Y= — 110804, —:0298, +1°0358, 


whence 8 (Ff @r\= 773,470, - 8 & Un) =-783,6384, 
N = Nan? 
yo, = 8795, ry ¢, = 8852. 
Thus taking as before Tay = C/{(T20,7y Cy)» 
we find Yay = 5240. 


Again in excellent agreement with the 5x5 contingency result and also the 
g sg s 

product-moment result. The evaluations of Z, and % are of course based on 
the assumption of a Gaussian distribution for those variates. 


Illustration II. The following table is given by Mr W. H. Gilby in a paper 
in Biometrika (Vol. vu, p. 106) on the “'Teacher’s Appreciation of General 
Intelligence.” It correlates class of intelligence with character of the clothing 
in boys. By treating the grades of intelligence and clothing as Gaussian variates 
an almost linear regression line was obtained in that paper. Hence it seems a 
favourable case to examine the present corrections upon. Intelligence was 
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measured on my scale (see Biometrika, Vol. vit, p. 93) and clothing was divided 
into five categories, IV and V being ultimately classed together. The better 


intelligence has the higher letter, the worse clothing the higher number. The 
table runs: 


Intelligence. 
B C D E F G Totals 
on i 33 48 | 113 | 209 194 39 636 
= II 41 | 100 | 202 | 255 | 138 15 751 
= III 39 | 58 | 70 | 61 33 4 265 
eS IVandV]{ 17 13 22 10 10 1 73 
Totals | 130 | 219 | 407 | 535 | 375 59 1725 


I shall use a for Intelligence and y for Clothing. Mr Gilby found raw 
¢? ='1014, corrected for 4x 6 cells ¢?= "0927 and C=-291. I find from left to 


right : 
4 @,=— 18859, —1:1009, — ‘4758, + ‘2428, +1:1179, 42-2167, 
Yy.=— 10066, + 2173, +1:2130, +2°1317. 
Hence 
Ms Hs) \ _, 3 895 LOG ee us 
8 (7 zs) 933,393, s ( i a) 812,475, 
rec, = 9661, yo, = 9014. 


If we now correct Mr Gilby’s contingency for the class-index correlations of 
x and y we have 
Tay = Cl(Ty C,7y c,) = '3342. 


This is very near the values, 343 and 340, found by converting the table into a 
three-rowed table, classes III, IV and V of clothing being grouped together and 
a bi-serial » method used (Biometrika, Vol. vin. p. 98). 


I then proceeded to work out on the full table given above the value of r 
as determined by the approximate formula (xvii). 


This can be conveniently 
arranged thus: 


| —1°8859 — 1°1009 — 4758 + °2428 +1:1179 +2°2167 
— 1:0066 33 48 113 209 194 39 
+ ‘2173 41 | 100 202 255 138 15 
+1:2130 39 58 70 61 33 4 
+2°1317 il 13 22 10 10 1 

| +33°2178 | +48°3168 | +113°7458 | —210°3794 | —195:2804 | — 39-2574 

— 8:9093 | —21°7300 | — 43°8946 | + 55°4115 | + 29°9874 | 4 3:2595 

—47°3070 | —70°3540 | — 84°9100 | + 73°9930 | + 40°0290 | + 4°8520 

— 36°2389 | -27°7121 | — 46°8974 | + 21°3170 | + 21°3170 | 4 2°1317 

= | = at ———— 
— 59°2374 | —71°4793 | — 61°9562 | — 59°6579 | —103°9470 | —29:0142 


17—2 
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Therefore S & wit) = —'240,51548, 
Nova, 


24051548 
"CeCy =~ 9661 x 9014 7! 
and r¢,¢,/(x 6, yO, = — B1T1*. 

In the above table each row is first multiplied by the y on the left and the 
final sign at once given to the product; this forms the lower half of the table. 
The columns are then added up with the results given at the foot. The column 
sums are then multiplied by their respective @,’s, shown at the top of the first 
half of the table, regardless of the sign of %,, and the sum divided by 1725 as a 
continuous operation on the calculator, which shows finally *240,51548. 


The value ultimately obtained, ‘32, is somewhat less than the 33 of the con- 
tingency result but of the same order for all practical purposes. 


I next reduce my table to a 4 x 4 table as follows: 


Intelligence. 
B+C | D | # | F+@ 

ch | I 113 | 209 | 233 

2 I 202 | 255 153 

= UI 70 | 61 37 

5 | IV and V 22, 10 11 

Totals 349 | 407 | 535 | 434 
We have #,=— 13934, —'4758, + 2428, 41-2674, 
Y:=—1:0066, 4+°2178, +4+1:2130, +4+2:1817, 

n.72 A],2 \ 

g Ng ks )= 868 (Fe me 

( Agi) = 868,606, S (spo) = 812475, 
Ve (Che = 9320, Ty C= 9014. 
Proceeding as before by formula (xvii) we find 
— '228,76485 


Tau = 368,606 x 812,475 ~~ Ot 


a result which is again in excellent agreement with the previous ones. 


Lastly I convert my table into a 3 x 3 table: 


Intelligence. 
| B+C+D| E | Fea | Totals 
i a 
a | 194 | 209 | 238 636 
s | 343 255 153 751 
4 219 71 48 338 
iS) | | 


756 | 535 434 1725 


* The negative sign shows that worse intelligence is associated with the poorer clothing. 
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I found H=— *8995, +°2428, +1:2674, 
j= —1:0066, +2173, +1-4110, 


Ns \ és Mee ae 
and s( We) = "776,941, s( on = "784,241, 
Vx C= 8814, Ty GG 8856. 


Proceeding exactly as before by formula (xvil) we have 


— '204,01275 


"oy = 784 4I x 776,941 = OO 


For comparison I worked out the above 3x3 table by contingency. The 
contingency corrected for 3 x 3 cells is ¢?='076,0779, and therefore C =:2659, 
hence 
'2659/(‘S814 x °8856) 


= 3406, regardless of sign. 


Vay 


To sum up our results for this case we have : 


| 
| Bi-Serial 7 Contingency | Formula (xvii) 


| | | 


6x2 Table 340 and -343 | = | = 
6x4 Table et (sem 
4x4 Table | ah 


| | | Corrected 
| 
| 


| 3x3 Table 


The results are very satisfactory and show that for correlations of this 
magnitude our corrective factors work excellently when we use contingency to 
find rg g,- Formula (xvii) also gives fairly consistent results, but this is partially 
due to the relative smallness of r. For the smaller 7 the more nearly 7% x 7, 
may be replaced by %, x 7 as is well shown in the case of the simple 2 x 2 table. 


On my scale of Intelligence seven categories are used ; there were in Mr Gilby’s 
schools none of the A or Mentally Defective class, so that he only used six, B—G. 
We see that there is a high degree of correlation between the actual intelligence 
of a child and its class index. We must have 


For seven classes T> 90, 
yo Six ie 7— Sis 
, four kd ioe 
, three rs r='88. 


It is of interest to put beside these the frequencies of my General Health 
Scale and the resulting class index correlations. The scale had six classes: 


y 
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Very Robust, Robust, Normally Healthy, Rather Delicate, Delicate and Very 
Delicate. Applied to 2037 schoolgirls (Biometrika, Vol. 111. p. 166) it gave: 


| V.R. R. N. H. R. D.+D. V.D. Total 
| | 
| 109 578°5 | 803°5 478°5 67°5 2037 
& 
We have: = | — 20342} —:9033 | +:0936 +1-0837 +2°2293 
x 
; ie Go 
leading to s(F 4) =897,1616, and rpg = 9472. 
. Neo Pea 
If we divide into four classes, thus: 
Vv. RB. R. N.H. R.D.+D.+V.D.| Total 
109 578°5 803°5 546 2037 
We have: =| —2°0342 — +9033 +0936 41:2253 
Ox 
: Mewes 
leading to 8 (7 *) ='858,9714, and ry¢ = "9268. 
‘ 


And lastly if we divide into three classes, thus: 


I WW TR TR, N. H. R. D.+D.4+V. D. Total 
| 
3 
| 687°5 803-5 | 546 2037 
Wehnve® 2-1) 1160826 +0936 +1°2253 
Or | 
xX | | 


s GS) = °801,3866, and yo, ='8952. 


leading to Voz 


Now if we put these results with those on p. 121 together we find: 


Number 
of 
Groups 


D Dis Og 


Gaussian Frequency Pearson’s Scale PonrconieiSonle 
EES penenan General Health 
Subfrequencies Intelligence 
891 *881 *895 
928 932 927 
947 — 947 
"959 ‘966 _ 


It will be noticed that the frequencies differ considerably, although not 
extremely, from equality. Accordingly we may find it sufficient in many cases, 
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although the frequencies are not symmetrical or equal, to state the value of 7, ¢, 


for the same number of groups with equal frequencies from the table on p. 121, 
and save the trouble of calculating rz ¢,. 


Illustration IIT. I took the 3 x3 table for correlation of health in pairs of 
sisters, 1.e. 
First Sister. 


N.H. |BR.D.+D.4+V.D.| Totals 


V. R.+R. 


| 

| 
a | 
5 ———E———_ —————— | 
@ | 
Dn V.R.+R. 428 172 687°5 | 
a IN EI ee 172 411 220°5 803°5 | 
q R. D.+D.+ V. D 87°5 220°5 5460 
5 | 
o — 
Z | 

2037 


and 1 determined the contingency. The raw value of N¢? = 4249285, and 
after correction for number of cells = 4209285. Hence 


d¢? = '206,6414, 
C= 4188. 
Therefore by the value just found for 7, ¢, for three health classes 
lay = eee 
801,3866 
I then worked out r,, by aid of the approximate formula 


N oy cy 


Poy = fe (2) tom (2) | 


which gives in this case Try = °4940. 


= 5164, 


The value as given in my Huxley Lecture* for the mean of two fourfold 
tables with divisions first between Robust and Healthy and then between Delicate 
and Healthy was 51. The agreement between the three methods appears 
reasonably good. I compared the last value with the same formula applied to 
the original 5 x 5 table (see Biometrika, Vol. UL, p. 166). I found 

( Nst UsYt “ 57 8 ik 
S G a = 392,7573*, 
and Try = ‘4880. 


It will be clear that for the type of argument which is generally based on such 
numbers the accordance is very satisfactory. 


“In the case of the 3x3 table this product was °317,2398, which measures the amount of 
correction made by denominator, 
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Illustration IV. I take now an extreme case, namely the Table: 


Totals 
| Oo 5793 
ooh eas 3399 
bs 808 
Totals 10,000 

| ; 


I call this an extreme case because the table is only 3 x 3, and the frequencies 
and the subranges are very unequal. 
We have &/o,=—'6172, +°7011, 4 1°6871, 
Yt/oy = — "6750, +°7100, 4+1°8531. 


] so D QI pet) = 6 4.9, ” — *Q95 
Whence I find 8 {5p ,) = 694,222, i = EY) 
Y: YE . y dt a Q 6 
sl ue) 712,753, Tyo, = 8442. 
I then worked out the mean square contingency and found for crude numbers 
No? = 51466. 
Corrected for number of cells gp? = 51426. 
Whence C = °5827. 
Therefore Nay = 5827 /((8332 x 8442) = 8284. 


The actual table has been constructed in rownd numbers as the distribution 
of a Gaussian frequency surface for r= ‘80, the divisions being at x/o,=+°3 
and + 1:2 and y/o,=+°2 and +1°4, and the correlation is hardly likely to differ 
by a unit from ‘80. Considering the marked inequality of the subfrequencies 
the corrected contingency approaches closely to the correlation—at least the 
approach is sufficient for any argument likely to be drawn from the data in a 
3 x 3-fold table. 

Illustration V. It is profitable to show the amount of error which will be 
introduced by taking a marked case of non-Gaussian frequency. When looking 
out for such cases many years ago, I found one of the most representative 
instances in the case of barometric heights. I take the following table from 
the memoir by Dr Lee and myself*, and I have arranged the 3 x 3-fold table 
so as to give very unequal frequencies. We have thus in this case non-Gaussian 
frequency and very unequal frequencies. We find 

#,=—1:1479, —-1581, +°9161, 
J,=—1:2797, —-2940, +-8424, 


* Phil. Trans. Vol. 190, A, p. 453. 
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Southampton. 


Nn 
| 
= 

=] 

2. 
oO | 
5 

w 
Ne) 
nr 
ae 

o 

ca 
a | 


Over 30°15 | 30°15—29°9! 


| Over 30:15 
30°15 —29°85 
Under 29°85 


Totals 


We have raw V¢? = 1449°52. 
¢? corrected for number of cells = "494,7023, O = °5753. 
es ‘5753 

¥ +8823 x ‘8736 
Found by the product-moment method* 
Without Sheppard : r,, =°7752 + 0050, with Sheppard: r,,=°7802 + ‘0049. 


Thus: = '7464. 


Thus the difference is more than four times the probable error, but is not of a 
nature upon which any sweeping conclusions would be drawn. Indeed some 
might prefer the value drawn from the corrected 3 x 3-fold table, owing to the 
distrust of isolated outlying observations which often widely modify the constants 
of a distribution calculated by product-moment methods. 


To still further test the pliability of the method, I rearranged the data in 
a 3x 8-fold table with markedly unequal frequencies but nearly equal ranges. 
thus: 


Southampton. 
31°05—80°15 | 80°15—29°25 | 29:25-—28:85 Totals 
“a | 80°85—29°35 ... 808-25 | 33°21 1541°5 
S| 29°85 28:85 ... 83°75 223-2: 1352°5 
Ries so e780... 0 , | 28 
4 fo a 
Totals 2922 


A more unpromising series of totals is hardly likely to be met with! We 


have 
%,=—11479, +°4467, + 24133, 
8 Ga) = 635,448, reo, = T9T2. 
ij, =— "7544, +°8044, 42-6797, 
s Ge) = +568,5512, yc, = 7540. 


Thus we reach some of the lowest values we have come across for 7,¢, and 
yc, 1 the case of threefold divisions. 


* In Phil. Trans. Vol. 190 A, p. 453, the value -7572 was printed in error for +7752. 
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Raw WV¢? = 766'887, corrected ¢? = '761,0838 and C= -4550, 
Pay = °4550/(7972 x °7540) = °7570, 
a result differing only by ‘02 from that of the product-moment method. 
It will be noticed that I have arranged the material so that the subranges 


are equal. If we work the 3 x 3-fold table out, using Sheppard’s correction, we 


find by product-moment 
= ‘7746, 


which is considerably nearer the mark than the corrected contingency value. 


In this case a poor result is given by the formula: 
Nee ©; Yi Mga, fede’ 
Sie ae nee) a 

(i Cx a No, (We aye 
the assumption that S (ny % Hu) = S (Net %Y,) being by no means satisfactory for 
(1) very non-Gaussian distributions, or (11) high correlations, when the subranges 
if equal are very large. For equal ranges with a moderate approach to the 
Gaussian distribution and a correlation not much exceeding 0°5, it gives quite 


good results. Thus if we take the table for stature in Father and Son and 
arrange it in practically equal subranges thus: 


Statur e no Father. 


q | 58: 5—OL5 C4L5—T0'5 705 —T6'5 Totals 
NM —se eer 
= Under 67°5 ... 86°25 | 269 | 9°75 365 
o | 67:5—73'5 39°25 | 495°5 129-25 664 
| 785—79°5 1 22 26 49 
= Eee 
= | Totals... 126° | 786-5 165 1078 
we find: %,=—16780, —:0540, +1:5439, 
ie =—1:0804, +°4387, + 2°1022, 
s & ws ‘) = 697,738, ro, = 8353, 
Ox = 
Ss (545) = 714,644, ry c, = 8454, 
ay : 


st © Us Yt Zor 
s(5 ne a) = +260,7392, 


= 5229, 
against 5259 by the product-moment methods. 
The same table yields by 3 x 3-fold contingency 
N¢? = 163:909, corrected ¢? = ‘148,3386, C= °3594, 
whence 7gy = ‘5089. 
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Thus the corrected contingency for roughly equal frequencies and equal sub- 
ranges gives respectively °52 and ‘51, but the formula (xvii) gives for equal ranges 
‘52 and for roughly equal frequencies 57. It seems to me therefore that corrected 
contingency can be more safely applied than formula (xvii). 


Conclusions. 


A further discussion of the corrections needful when using the method of 
mean square contingency will shortly be published, but the present paper seems 
to indicate that it can in the bulk of cases for 3 x 3-fold, 4 x 3-fold or 4 x 4-fold 
tables be used effectively. I am not aware that any other effective method has 
been proposed for such tables. The assumption of the Gaussian distribution to 
determine the correlation of the variate with its class index need not be made, 
if the material thrown into broad classes has had a sample quantitatively 
determined. But we have seen in this paper that the Gaussian assumption to 
fix the means of the broad classes is in many cases amply sufficient to give good 
results even in non-Gaussian frequencies. Of course a control series to determine 
Tz¢, Where practicable, may be advantageously sought*, but until some better 
method can be suggested, the present seems to me the best available for dealing 
with the correlation of variates classed in a few broad categories. 


With contingency tables of 5 x 5 or 6 x 6, the correcting factors will generally 
be small, but corrections must certainly be made for 4 x 4 and 3 x 3 tables. 


* For example, I took Southampton frequency, and calculated from the original table the means of 
the three groups 892, 1970°5, 59°5, corresponding to the table on p. 137; they were —1:1006, + °4123, 
and +2°1746, leading to TO ='8056, as against the Gaussian value ‘7972. This is for a sensibly 

© x 


non-Gaussian frequency, and a much worse agreement is permissible in actual practice, where the 
nature of the argument rarely turns on correlation differences of less than -05, 
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THE recent paper by Mr Yule entitled “On the Methods of Measuring Associa- 


tion between Two Attributes *” calls for an early reply on two grounds,—first 
because its singularly acrimonious tone is to us wholly inexplicable, not to say 
unusual, and secondly because we believe that, if Mr Yule’s views are accepted, 
irreparable damage will be done to the growth of modern statistical theory. 


* Journal of the Royal Statistical Society, Vol. Lxxy, 652. London, 1912, 
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Mr Yule has invented a series of statistical methods which are in no case based 
on a reasoned theory, but which possess the dangerous fascination of very easy and 
ready application, and therefore are at once seized upon as applicable to all sorts 
of problems by those who are without adequate training in statistical theory, 
or without the mathematical knowledge requisite to weigh cautiously their logical 
basis. The methods to which we refer are these : 


? 


(i) the use of the so-called “coefficient of association” to measure the 


relationship of two attributes ; 


(ii) the use of a new coefficient, which Mr Yule terms a “coefficient of 
colligation,” apparently to be used in like cases with the coefficient of association ; 


(iii) the development of a method which first appeared in a paper by the late 
Mr John Gray *; in this method each group of a contingency table is considered 
as a cell of wnit subrange for both variates. This assumption being made, Mr Yule 
calculates the coefficient of correlation by the product-moment method, and on 
the basis of this procedure terms his coefficient the coefficient of correlation and 
uses the customary letter 7 for it. 


Such a terminology is absolutely unjustifiable and can only confuse the 
uninstructed and undiscerning reader. If the groups were extended from 5 or 8 
to an indefinite number, all Mr Yule would reach by this method would be a 
correlation of ranks, not of variates. As it is, he has obtained a correlation of 
ranks with enormous “ brackets.” It does not seem to have occurred to him that 
the correlation of ranks may be quite different from the true correlation of variates, 
and that in cases where we do know the relationship the correlation of ranks 
is sensibly lower than that of variates. Further, he makes no suggestion that a 
very fundamental correction—that of the variate and class-index correlation—is 
needful before this method could possibly be applied to deduce a limit to the true 
correlation of variates. For these reasons we shall term Mr Yule’s latest method 
of approaching the problem of relationship of attributes the method of pseudo- 
ranks. We are concerned principally therefore in this paper with the so-called 
method of association and the method of pseudo-ranks. In addition we deal 
incidentally with other coefficients and reply to certain criticisms, not to say 
charges, Mr Yule has made against the work of one or both of us. 


(2) History of Subject. 


In view of the misstatements made in the discussion at the Royal Statistical 
Society, with regard to the history of the subject, a few preliminary remarks of an 
historical nature may be fitly made here. Mr Sanger, for example, said that “all 
statisticians before Mr Yule had this passion for the normal curve.” This statement 


* Gray and Tocher, Journal of Anthrop. Institute, 1900, Vol. xxx. p. 111. 

+ On the difficulty of ‘‘ brackets” in the correlation of ranks, see Pearson, ‘‘On Further Methods 
of determining Correlation,” Drapers’ Research Memoirs, Dulau and Co., p. 36. 

+ Journal Roy, Stat. Soc. Vol. uxxv. p. 645. 
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is not only unfair to Perozzo, but to one of us, whom Mr Yule was directly attacking. 
Pearson’s memoir on “Skew Variation in Homogeneous Material” was sent to 
the Royal Society on December 19, 1894. Mr Yule’s memoir on Association 
was presented on October 20, 1899. He had previously attended the statistical 
lectures of Pearson and been an assistant to him during a period when nearly 
the whole work of the statistical laboratory turned on non-Gaussian distributions. 
A collection was then made of non-Gaussian material with a view to dealing with 
the correlation surfaces of continuous non-Gaussian variates. Among the material 
especially selected as providing extreme cases were (i) barometric heights (memoir 
presented 1897) and (11) ages of husband and wife; of the latter the laboratory 
still possesses the contour lines drawn by Mr Yule under Pearson’s direction to 
indicate extreme cases of what the latter has termed skew variation, or wide 
deviation trom the normal surface. Another illustration of marked skewness is 
that of the contour lines of the correlation between the numbers of a particular 
suit in two partners’ hands at whist. These latter curves were published by one of 
us in 1894, and at that time* it was distinctly stated that the contour curves for 
ages of husband and wife differed widely from the Gaussian type. It is singular 
that Mr Yule in the paper we are about to discuss should have made use of two of 
the extreme types of non-Gaussian frequency with which he was very familiar when 
he was an assistant in the University College Department of Applied Mathematics, 
and yet have allowed such a statement as that of Mr Sanger’s to remain uncon- 
tradicted. The fact is that the promise made in 1894+ to deal with skew 
correlation surfaces only remained unfulfilled because the differential equations to 
the surfaces obtained in that year have so far defied integration. Undoubtedly for 
continuous variates a generalised correlation surface should be the starting-point 
for attacking the problem of association}. That it has remained unsolved shows 
only the extreme difficulty of the problem; it does not indicate that all 
statisticians before Mr Yule had “this passion for the normal curve.” 


And here we will at once emphasise the fundamental difference between 
Mr Yule and ourselves. Mr Yule, as we will indicate later, does not stop to discuss 
whether his attributes are really continuous or are discrete, or hide under discrete 
terminology true continuous variates. We see under such class-indices as ‘death’ 
or ‘recovery, ‘employment’ or ‘non-employment’ of mother, only measures of 
continuous variates—which of course are not a priori and necessarily Gaussian. 
Mr Greenwood in the discussion on Mr Yule’s paper referred to the jibe§ about 


* Phil. Trans. Vol. 186 A, p. 411. 

+ Ibid. p. 411. 

+ Such a surface, however, involves 5, 7, or more independent constants, not the three of the 
Gaussian surface, and as the fourfold table has, apart from its total, only three available constants, we 
could not hope to determine such a surface for a fourfold table without some additional knowledge or 
hypothesis, other than conveyed by the table itself, as to the nature of the frequency. 

§ ‘‘We are considering,” writes Mr Yule, “simply the performance as against the non-performance 
of the operation of vaccination. Similarly all those who have died of small-pox are equally dead: no 
one of them is more dead or less dead than another, and the dead are quite distinct from the survivors” 
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persons being ‘dead’ or ‘not dead’ and questioned whether Mr Yule was correct 
in treating the variate behind the class-index as discrete and not continuous. 
In the original paper of Pearson “strength to resist smallpox when incurred ” 
was stated to be the variate, and all the evidence that has been produced since 
indicates its continuity; in precisely the same way vaccination and non-vacci- 
nation represent degrees of immunity in a continuous variate of which area of 
vaccination as indicated by extent of cicatrix and period since vaccination are 
contributory quantitative factors. Again “employment or non-employment of the 
mother” are not taken by us as signifying the presence or absence of a mere 
discrete attribute—for example whether she works in a factory or not—but as a 
class-index indicating that employed women, who have not only their home work 
but factory labour also, have on the whole more physical exertion to endure than 
those who are simply housewives. We are really seeking how far the continuous 
variate physical exertion of women affects infant welfare, and this is not a discrete 
variate any more than survival or death of infant is a discrete variate, when you 
view them merely as class-indices of physical fitness to survive in the child. In 
other words, for the great bulk of attributes, to which Mr Yule without analysis of 
their nature applies association, we should assert continuous variation. We hold 
therefore that in the main we are applying fourfold or other class divisions to 
continuous variates. Mr Yule thinks he has freed himself from all consideration 
of what the nature of this continuity may be; we consider his belief wholly 
fallacious. You cannot free yourself from some assumption as to the nature of the 
distribution when you are dealing with the association of attributes. And in 
ignorance of what the true distribution may be, what assumption will help you to 
the most probable result ? On the basis of a very large experience of frequency 
curves and surfaces we have no hesitation in saying that up to the present time no 
distribution has been proposed which roundly represents experience so effectively 
as the Gaussian frequency. One of the present writers has indicated over and 
over again how it fails, and he has measured the significance of its failure, but has 
always recognised that he must put against this the large percentage of cases in 
which it gives reasonable results, close enough for all practical purposes*. Mr Yule 


(J. R. S. S. Vol. uxxv. p. 612). Who the ‘‘we” are, Mr Yule does not tell us; but suppose ‘‘we” 
started to find the relation between age and place in an examination—say the mathematical tripos— 
should we learn more or less by treating the wranglers as a class-mark with no graduations and age 21 
as a fixed division, or by assuming that the fourfold table: wrangler—not wrangler, minor—not minor, 
really covered continuous variations of age and class-place? Vaccination means vaccinated a week 
ago or forty years ago, a graduated immunity ; the dead means a group who not only had no power 
to resist an attack of the given intensity, but in certain of the cases an attack of a far less intensity—it 
covers a class with graduated power of resistance. Mr Yule here as elsewhere is mastered by words, not 
seeking the realities behind classification. 

* See, for example, C. D. Fawcett, Biometrika, Vol. 1. p. 443; W. R. Macdonell, Vol. 1. p. 184, 
Vol. ur. p. 227; Pearson and Lee, Vol. u. pp. 362—7; R. Pearl, Vol. 1v. p. 40; ete. ete. Compare, 
however, J. F. Tocher, Vol. v. p. 300, who for long series finds a certain amount of deviation from 
normality, generally in the direction of “leptokurtosis ” (82 is large) not of much asymmetry (8, is small). 
Eyen in these cases we doubt whether any serious practical error would be introduced by the use of the 
Gaussian distribution, unless extreme dichotomies are made. 
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with a bias which may well be called in question has gone out of his way to 
select extreme cases, which had already been indicated by one of us as markedly 
non-Gaussian, but he makes no attempt to measure the wide range of physical 
characters for which the Gaussian is a legitimate practical assumption. Mr Yule 
refers to Dr Macdonell’s memoir as a case in which the applicability of the Gaussian 
fourfold table method was “in the first place adequately tested” before adoption *. 
He leaves his uninitiated reader ignorant of two important facts, (i) that in the 
majority of fourfold classifications there is no possibility of such adequate testing 
because only the fourfold division has been provided, and (ii) the test in this case 
was directly made at the suggestion of Pearson and in his Laboratory to test the 
efficiency of the Gaussian method on ordinary data such as form probably nine-tenths 
of the frequencies which occur in practice. The work on this paper of Macdonell’s 
began almost immediately on the completion of the theoretical memoir of 1900 on 
the fourfold table, and Mr Yule’s statement that the warning of Pearson in the 
fundamental memoir of 1900 that normal correlation was not universal “seems to 
have been forgotten in a few weeks at mostt” is, as many others of his statements 
from the historical standpoint, hopelessly inaccurate. Thus the paper on eye- 
colour in man and coat-colour in horses was presented in August, 1899, and 
antedates the presentation of the theoretical paper of February, 1900. The 
“warning” could hardly have been promptly forgotten, for the paper was with- 
drawn and rewritten in order to test the value of the method of association then just 
propounded by Mr Yule, and to develop, when that was found defective, what, it was 
believed, was a better treatment. Mr Yule writes that “Professor Pearson raised no 
objection then and as far as I know has raised no objection since to my coefficient 
@; indeed he referred to ‘the extreme elegance and simplicity of Mr Yule’s 
coefficient of association.” Naturally when one finds a method wholly inadequate 
one does not turn and rend an old pupil and former colleague. What Pearson did 
do was to test Mr Yule’s Q against other similar coefficients and finding it less 
stable than any of them, it was dropped and has never been and never will be 
used in any work done under his supervision. But an interesting point arises 
here, which it is, perhaps, worth mentioning. Endeavouring to find for any 
fourfold division an analogue to Sheppard’s median division formula, 1e. for a 


Gaussian fourfold 


a | b 
e|d 
ama—b 
= COS T apm) on eehy 
Pearson hit upon the fact that the fourfold 
va | 
vibe | Vad 


* Journ, of R. S. S. Vol. uxxv. p. 63( 
+ Ibid. p. 614. 
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has the same association coefficient as 

a|b 

c|d 
and that accordingly if the coefficient of association were a valid measure of 
relationship every fourfold could be expressed in a form which led to a Gaussian 


r=sin = (eae =) 

2 \Wad + Vbe ‘ 
Thus Pearson’s Q; was a direct result of writing the fourfold in the “equalised” 
form 

vad | vbe 

vbe | Vad 
which Mr Yule now proposes as a primary virtue of his method as giving all 
classes their “natural” or equal percentages. The new “coefficient of colligation” 
is thus really an old friend, which under the form Q, did not possess “the funda- 
mentally different properties*” with which Mr Yule credits it, being the direct 


and we venture to think legitimate offspring of the equalised frequency table 
as figured above. 


The next historical point where Mr Yule seems to be at fault—at any rate in 
his criticisms of one of the present writers—is in his interpretation of the word 
correlation. He narrows it down to the significance of correlation coefficient found 
by the product-moment formula, and so obsessed is he by this idea that he applies 
it to a correlation of gross ranks, which is not a correlation of variates at all. 
The word correlation in the statisti¢al as distinguished from the biological sense, 
we believe, was first used by Galton in his memoir of 1889 entitled : “ Co-relations 
and their Measurements, chiefly from Anthropometric Datat,” and he gave a 
definition of it which does not involve the conception of the product moment or the 
linearity of regression at all. That definition was extended by one of the present 
writers in a memoir of 1895t—and it runs: “Two organs in the same individual, or 
in a connected pair of individuals, are said to be correlated, when a series of the 
first organ of a definite size being selected, the mean of the sizes of the corresponding 
second organs is found to be a function of the size of the selected first organ. If 
the mean is independent of this size, the organs are said to be non-correlated. 
Correlation is defined mathematically by any constant, or series of constants, which 
determine the above function.” It will be seen that this definition of correlation 
has nothing whatever in it that limits the use of the word ‘correlation’ to the 
coefficient of correlation as found by the product-moment method. Galton himself 
never used the product-moment method to find his “index of correlation.” He 
had generally in view§ the position that the average value of one organ or 

* Journ. of R. S. 8. Vol. uxxv. p. 592, footrote. 
+ Roual Soc. Proc. Vol. xuy. p. 135. 
“SNAG TDs 2OKs 


id to be co-related when the variation of the one is accompanied on 
.tion of the other....”? R. S. Proc. Vol. xiv. p. 135. 
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attribute associated with a given value of a second changes continuously as we 
change that given value. It was only his wide experience of anthropometric data 
which led him to believe that in most cases the function that the mean of one 
organ is for a given value of the second may be adequately represented by a 
straight line. There was nothing either in his own treatment or in the work of 
his followers of the Biometric School, which pinned them down to the use of the 
word correlation for a particular constant found by the product-moment method. 
The “correlation ratio” has in the work of that school just as much significance as 
the “correlation coefficient,” and it is only Mr Yule who proposes to confine the 
use of the word correlation to the narrow sense of straight line regression deter- 
mined by product-moment methods. To the biometrician correlation when it 
ceases to be linear is not determined by the product-moment value of 7 at all, 
and the grade of correlation may be far higher than the value as determined by 
the coefficient of correlation. In the same way other constants may be found 
defining the relationship of two continuous characters, or measuring their degree of 
dependence. These are equally measures of correlation in our sense of the word. 
Mr Yule’s coefficient of association is not in our sense of the word a measure of 
correlation at all, it shows in no manner how the mean of one attribute for a given 
value of a second attribute varies as we modify this value. It is, as we shall show 
below, impossible to give it in the case of continuous variates any rational 
significance whatever. Where there is no true correlation at all, the size of 
Mr Yule’s Q may be produced solely by a lack of homoscedasticity—of equal varia- 
tion—in the arrays of one variable associated with constant values of the second, 
but in what manner it measures this heteroscedastic property is quite beyond 
interpretation. Mr Yule claims that the nature of the frequency is of no conse- 
quence, he states that the coefficient of association may be applied without any 
general theory of frequency. For us this is not a correct attitude; we admit wide 
deviations from Gaussian distributions, but such cases are not the rule. Mr Yule 
can pick out special instances which are far from Gaussian, such as age distributions, 
barometric heights, or heterogeneous mixtures of growing organs like ivy leaf 
lengths. Even for such cases he has not examined in any adequate manner how 
far methods based on Gaussian or other allied considerations do give practical 
results, nor how far even the Gaussian fourfold »—tetrachoric 7 or 7;, we will call it 
for the purposes of this paper—is a more stable and reliable coefficient than those 
suggested by himself. 


He has indeed criticised the application of a tetrachoric 7; to eye-colour data— 
his discussion of the subject will be considered in a separate section of our reply— 
but he does not inform his readers that the one of the present writers concerned in 
the eye-colour treatment fully admitted in June 1907 * “the unsatisfactory approach 
to the Gaussian distribution found in pigmentation tables” and stated that his 
very knowledge of this point had led him to develop the method of contingency for 
such problems. Yet Mr Yule raises five years later the applicability of the 


* Biometrika, Vol. v. p. 472. 
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normal coefficient, the tetrachoric 7, to pigmentation tables without a line of 
reference to a paper, which must have been perfectly well known to him, for it is 
entitled “A Reply to Certain Criticisms of Mr G. U. Yule.” This may be strategic, 
but at the same time illustrates the peculiar character of Mr Yule’s controversial 
methods. 


(3) On the Boas-Yulean “ Theoretical Value” of the Correlation (Pearson's ?). 


We now come to a very nice point indeed, namely Mr Yule’s “theoretical 
value of the correlation coefficient,’ the method introduced into his Textbook 
of Statistics (p. 212) without a word of warning as to when it should be applied. 
Mr Yule now states that: “It would have been thought that anyone reasonably 
acquainted with the theoretical work of the last decade, and especially Professor 
Pearson and his collaborators, would have found no difficulty in the passage in 
question*.” Now what exactly have Pearson and his collaborators done? They 
have applied the product-moment correlation to the presence of 0, 1, or 2 
protogenic wnits in theoretical Mendelian investigations. They have assumed that 
when a character goes by wnits, you may apply the usual product-moment methods. 
But they have objected in toto to the application of such a method to material 
where there was reasonable evidence of continuous variation. Does Mr Yule look 
upon ‘death’ as the addition of one unit to ‘recovery’? Does Mr Yule look upon 
vaccination as the addition of one unit to ‘absence of vaccination’? Does Mr Yule 
look upon ‘mental defect’ as the addition of one unit to normal mentality? In the 
three Mendelian types (RR), (DR), and (DD) there is a progression of one unit at 
each stage in the number of D’s, but what are the units in the cases we have 
mentioned? Or, does Mr Yule suggest that his “theoretical value of the correla- 
tion” is to be confined to those actual true unit additions to which Pearson and his 
collaborators have always confined them? There is not a hint of this in his Text- 
book nor in his present paper. He has indeed carefully refrained hitherto from 
saying what are the characters of the attributes to which it is to be applied. He 
has suggested that “the ordinary theory of correlation, once that theory had 
been freed from any necessary relation to the theory of normal correlation, was 
applicable in its entirety to the 2 x 2-fold tablet.” Mr Yule says that this should 
be “a very obvious matter.” Indeed !—Then apparently ‘vaccination’ and ‘ mental 
defect’ are a quantitative unit more than non-vaccination and normal mentality !— 
But how does this fit with Mr Yule’s other assertion that these are discrete attri- 
butes and suitable for the application of his coefficient of association? Let us 
see how Dr Boas investigates Pearson’s 1,= $f. 


“Correlations of phenomena that cannot be measured but only counted may 
be treated in the following manner: If two events that have the probabilities 
p, and p, are correlated, we may say that those cases in which the event 1 occurs 

* Journal of R. S. S. Vol. xuv. p. 609. 


+ Journal of R. 8S. S. Vol. xtv. p. 606. 
+ Science, May 1, 1909, p. 824. 
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have the probability 1, or a deviation from the normal probability 1—p. Those 
cases in which the event 1 does not occur have the probability 0, or the deviation 
from the average probability of — p,.” 


Pearson at the time read this many times through ; both of the present writers 
have read and re-read it since, and they fail utterly to grasp how an event can 
have at the same time a probability of 1 and of 0 and a normal probability 
of 1—p! Mr Yule says that Professor Pearson failed “to understand what Dr Boas 
was doing*.” We still fail completely to understand what Dr Boas means, or 
how Mr Yule justifies the assertion that Dr Boas was demonstrating a formula 
which applied to two values of a character differing by a unit. What did come 
out of Dr Boas’ investigation of 1909 when it was translated into the fourfold 
table terminology 


a+b 


a | b 
c d c+d 


ate | b4-d N 


ee 
~ V(b+d)(a+c)(c+d)(a +b) 


was Uh 


> 


a value already known (1) as the correlation 7, between the means, each measured 
in terms of their standard deviations, of two variates of a fourfold Gaussian table+ 
or (11) as the square root of the mean square contingency of a fourfold table 
without any assumption of a Gaussian distribution f. 


Now it is known that the correlation of errors in two means, 77, 18 equal to the 
correlation of deviations, 7,,, in the two variates of which @ and ¥ are the means. 
But rp, 18 not rz, for h=Z/o, and k=7/o, and o, and cy, are correlated as well as 
xand 7. It is therefore clear that if # and y are continuous variables of any kind, 
Tre 1S not a “ theoretical value of the correlation” of variates. It is a correlation of 
ranks where the ranking consists of only first and second, and is wholly uncorrected 
for class-index. It becomes a true value of the correlation when the two classes 
differ by a unit quantity as in the units of theoretical Mendelism. There is not a 
word of this in Dr Boas’ paper; he speaks of his formula as applicable when things 
can be counted but not measured. Mr Yule speaks of it as applicable in its 
entirety to the 2x 2-fold table!—When pressed he says it would have been 
thought that no one acquainted with the work of the Biometric School—on 
Mendelism—could fail to understand what his passage signified. We wholly fail 
to understand it now. Is 7;,, Le. ¢, applicable to every fourfold division? If so, 
why does not Mr Yule use it and drop his coefficient of association ?—In truth he 


* Journal of R. S. 8S. Vol. xiv. p. 608. 
+ Phil. Trans. Vol. 195 A (1900), p. 12. 
{ Drapers’ Company Research Memoirs, ‘‘ On the Theory of Contingency,” 1904, p. 21. 
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cannot do it without confessing himself hopelessly in error!*—If $ or rpz is the 
right thing to use for a fourfold table as Mr Yule now suggests, then his w and his 
Q are hopelessly wrong, for all the selections which do not alter his Q—and the fact 
that it is not altered by selection is according to him one of its great merits—do 
alter, and this to any extent we please, his “theoretical value of the correlation.” 
This very point was emphasised by one of the present writers in a paper recently 
published in Biometrikat but Mr Yule appears wholly to have missed the essential 
features of that criticism. The poimts were that (i) there was a wide range of 
values of @—Mr Yule’s coefficient of association—for a surface of the same correla- 
tion ; if it is impossible to compare the values of Q for the same surface divided at 
different places with any intelligible result, what possible comparison can be made 
of Q from one system to a second ?, and (ii) the values of Q and r;, or ¢ are wholly 
different and tend in opposite directions as we change our divisions. Under 
“wholly different” we include liability to be “wholly differently interpreted.” 
Both Q and ¢@ range numerically between 0 and 1. Therefore in estimating the 
meaning of Q and @¢ or 7, we have to consider where they stand on this range. 
Mr Yule examining the associations between developmental defects, nerve signs, 
low nutrition and mental dulness finds values ranging from ‘75 to ‘95 for his 
coefficient QY. He comments: “The associations are, however, all high (very high 
compared with most coefficients of organic correlation with which one has to dealf), 
ranging from “784 (? °750) to 9526.” Elsewhere in the same paper Mr Yule 
speaks of ‘174 as a “very small association,” and a‘8 to ‘9 association “as very 
high indeed ||.”._ We know accordingly what Mr Yule understands by high and low 
association. Indeed if a scale of values is to lie between 0 and 1, those approaching 
0 must be very low and those approaching 1 must be very high. Now Heron 
applied Mr Yule’s or Dr Boas’ “ theoretical coefficient” to precisely the same data 
as those for which Mr Yule had calculated his association Q, and found that it was 
very high. Heron found that for Q=-921 and ‘753, the “theoretical value of 7” 
=°011 and -006 respectively. If both these ways of investigating relationship are 
valid, then ‘011 and 006 must on a correlation scale represent a “ very high degree 
of association.” It would be interesting to know how Mr Yule would describe 
$= °'95 or what represents a low association, if ‘01 corresponds to a high degree of 
association !—But any one who is familiar with coefficients of correlation—and ¢ 
or 7,4 18 a real coefficient of correlation,—knows that values of ‘01 and under are 
extremely low values and, whatever their probable errors may be, are of no 
significance for purposes of prediction. All Mr Yule can say in reply to Heron’s 
statement that one of Mr Yule’s methods gives very high relationship and the 

* See also pp. 172-4 below, where this point is touched on again. 

+ <The Danger of Certain Formulae suggested as substitutes for the Correlation Coefficient,” 
by David Heron, Vol. vii. p. 109. 

+ Here and at other points of his earlier papers, Mr Yule apparently considers that @ is really 
comparable with the true correlation. 

§ Phil. Trans. Vol. 194 A, p. 300. See also ‘‘high degree of association,” Theory of Statistics, 
p. 34. 

|| Ibid. pp. 289 and 296. 
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other very low relationship between the attributes under investigation, is that 
“these small figures represent perfectly appreciable intensities of the product sum 
correlation r.” But this is an entirely different matter *, the intensity of correla- 
tion on the scale 0 to 1 is quite distinct from its degree of reliability. Mr Yule 
would have termed ‘95 a very high association had it been deduced from 1000 cases, 
and we term ‘01 a very low correlation even when deduced from a census population. 
Here are the results for which we have calculated the probable errorst. 


* Mr Yule himself says (loc. cit. p. 651) that the statistician must always keep apart the magnitude 
of the association and the reliability of this magnitude. We doubt the truth of this statement, but it 
cuts against his own argument at this point! 

+ The probable error of a coefficient of correlation for any frequency distribution is given by the 
formula : 


"674497 | Pop Peo, Pio , Por _ Pat Pi3 \8 


P.E. of r="674490,= Si Tete ie oes 
n 


= 9 oe 2 
Pi? 2p20Po2 4p207  4P02” PuP20 P11Po2 
(Pearson, Drapers’ Research Memoirs, ‘‘ General Theory of Skew Correlation,”’ p. 20). 


The whole work therefore of finding o, turns on evaluating pij1, P22, P31, Pi3, Po and pog for a 
fourfold table : 


a b a+b 


c d c+da 


at+c | b+d n 


when we suppose concentration at single points of the frequencies. 
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Relationship between Blindness and Mental Defect 
for different Age Groups. . 


73 ; 
Age Group eeu i bi sae : 

j— 70113 + :0030 
10— ‘0100 + :0022 
15— ‘0065 + ‘0017 
20— ‘0061 + :0015 
Bi -0046 + 0009 
B— “0060 + 0010 
45— *0053 + ‘0008 
00 ‘0059 + :0012 
65— *0028 + ‘0012 
To— — 0031 + 0014 
85— ‘0058 + °0065 
All ages 5—85 ‘0066 + :0002 


Is it conceivable that, if Mr Yule had approached the problem of the relation- 
ship of blindness and mental defect from the standpoint of “the theoretical value” 
of r and found that the maximum value of the coefficients obtained was ‘0113, 
there would have been any talk of the high association of the two attributes? If 
this on the Boas-Yulean scale means high association, what language would Mr Yule 
find to describe a Boas-Yulean coefficient of (96? Instead of replying to this criticism 
of Heron, Mr Yule states that this series of values confirms his view that the 
association decreases with age! If the reader looks at the diagram below, he will 
see the mean value of the Boas-Yulean coefficient with twice the probable error 
of each age sample set off either side of it. He will note (i) that there is only one 
age 75—80 where the deviation from the average value becomes significant ; 
(ii) that from age 15 to age 60, the polygon is practically horizontal and agrees 
with the mean; (111) the “high” values (01 order!) occur in childhood, especially 
ages 5—10, where diagnosis of mental defect is doubtful, and the low values at 65 
and onwards (where they are even contradicted by age 85 onwards !), just the ages 
when senile decay and old age cataract may lead the recorder of a census return to 
almost any statement as to mental derangement and blindness. We feel fairly 
confident that the unbiased statistician with this result before him could only con- 
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This form agrees with that obtained by Yule and verified by Greenwood, but our deduction of it 
appears to be the natural method and shows its relation to the general formula for the probable error 
of r. 
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clude that the Boas-Yulean method indicated no practically important relationship 
between mental derangement and blindness and that there was no trustworthy sign 
of any modification of this relationship with age. The point is not whether such 
a relationship really exists or not, but that one method advocated by Mr Yule 
shows definite results (high association), although a second, equally strongly 
advocated, fails to give any association of practical value and is thus in direct 
opposition to the first*. Mr Yule has omitted to indicate which method is the 
proper one to use in such cases. Are both right or both wrong? One method 
Mr Yule states may be applied to all 2 x 2-fold tables, the other should be used 
for “discrete” quantities. When is a quantity “discrete”? Mr Yule confuses 
“discreteness” in the class-index—a mere verbalism—with discreteness in the 
attribute classified under it, and this reduces his investigations from the plane 
of practical statistics to the field where we originally placed them, that of 
theoretical logic. 


Diacram I. Diagram showing absence of any relationship of practical value between 
Blindness and Mental Defect. 


Blindness and Mental Defect. 


Average. 


Coefficient of Correlation. 


aegis mslae alias opilaer ee ee ee = a a 
§ 10 15 20 25 35 A5 55 65 75 85 95 


(4) On Association and the Boas-Yulean Coefficient. 


Again we reach an interesting point which Mr Yule has failed to elucidate. 
It is best illustrated by an example. Take the following table for lengths of ivy 
leaves on the same spray—one of the examples selected by Mr Yule: 


* See also p. 204 below. 
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a Length of First Leaf. 
s Under 6:95 mm. Over 6:95 mm. Totals 
S eee 
D 
sao Under 14:95 mm. ... 6,943 46,673 53,616 
Oo | Over 77:95mm:; ~~: 41 6,343 6,384. 
aH 
“ Se SESS 
ae 
3 Motalsuage Wes 6,984 53,016 60,000 


The true correlation of the material is ‘5672 and the tetrachoric r= '5572 + 0058. 


It may be presented in any one of the three forms (i) as A above, (ii) in one of 
Mr Yule’s “natural forms *,” e.g. 


B. 6,636 | 1,383 | 8,019 


1,383 | 6,636 | 8,019 


8,019 | 8,019 | 16,038 


Or, again, as 


C. 6,943 4:6673|  6,947°6673 


410,000 | 6,343 416,343 


416,943 | 6347-6673 | 423,290°6673 


Now Q is the same = ‘9167, “very high association” for all these forms, and 
a posteriort given Q we should not know whether it had arisen from A, B or C. 
The values of the Boas-Yulean coefficient for these tables are: 


A, $="1183, 
B. $ =°6551, 
C. b= 0152. 


Which of these values are we to adopt? Are we to use the table as it originally 
is given? Or the “natural” form B? Or the selected form C? How can two 
coefficients ever lead to the same results, when an adjusting process which does 
not modify one, a property claimed as one of its advantages, changes the other to 
any value we please from zero up to Mr Yule’s new coefficient of colligation t ? 

* See Journal R. Stat. Soc. Vol. uxxv. p. 590. 
+ Taking the usual form of the fourfold : 


a b a+b 


c ad c+d 


ate | b+d n 
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Let us examine the effect of “adjusting” the tables in Mr Yule’s cases of 
vaccination and death at Sheffield, Leicester and Homerton-Fulham. He asserts 
that it is “natural” to take 50°/, of vaccinated; we fail to understand why 50°/, 
is more or less “natural” than 70°/, or 95°/, or than the percentage which actually 
occurs in the smallpox cases of those towns. Even if he takes 50°/, of vaccinated, 
why it should be “natural” to take 50°/, of deaths also is to us equally mysterious, 
and we believe must be to that juryman “any man of ordinary intelligence” to 
whom Mr Yule appeals. The following table gives the values that would arise 
from different methods of “adjusting” the tables: 


Smallpox—Vaccination and Death. 


Yule’s 
Association Boas-Yulean 1, C, iP 
(a) (0) (c) | @) (e) 

Percentage of Deaths... 50 °/, 50°/, Actual) Actual) Actual| Actual} Actual) Actual 
Percentage of Vaccinations 50 °/, 50°/,| 70°/, | Actual) 95°/, 

Sheffield 902 6380 | °531 479 "383 “769 "432 «| 5:15/10284 
Leicester ont 862 572 | °249 233 190 ‘611 ‘187 | 2°46/101 
Homerton-Fulham 804 504 | *423 “409 084 662 379 | 1:22/1044 


It is clear that the Boas-Yulean method will give any results whatever between 
zero and those in the (6) column, according to what percentage we choose to take 
in the adjusted table of deaths and vaccinations. We can also change the perfectly 
arbitrary order that Mr Yule has given for the three towns. It appears to us 
that his statement that “it should have been an obvious matter that the 
ordinary theory of correlation, once that theory had been freed from any 
necessary relation to the theory of normal correlation, was applicable in its 


modify it by a Yulean selection using the factors J and m into 
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@ remains unchanged. But ¢ now takes the value 
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=w, the coefficient of colligation. 
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entirety to the 2 x 2-fold table” is wholly inconsistent with any validity in the 


coefficient of association. 


It is not only that we can give a vast range of values to ¢ for a constant Q, but 
equally we can give @ a whole range of values, starting at the symmetrical table 
value and proceeding up to unity for a constant ¢. Examine for example the 


following series of fourfold tables. 


The first is the table as it actually occurred 


with Q =°5078, the second is Mr Yule’s “equivalent” symmetrical table with of 


course the same Q. 


(1) 29600 
37200 


17300 
66600 


46900 
103800 


66800 | 83900 | 150700 
Q="5078, ¢="2542. 


() rises. 

(3) 45452 | 47103 | 92555 
12693 | 45452 | 58145 
58145 | 92555 | 150700 

Q= 5511, = 2728. 

(5) 39852 | 66467 | 106319 

4529 | 39852 | 44381 
44381 | 106319 | 150700 
QO = 6313) Oi—- 228. 
(7) 35852 | 77349 | 113201 
1647 | 35852 | 37499 
37499 | 113201 | 150700 
Q= 8196,  =-2728. 
(9) 32952 | 84541 | 117493 
255 | 32952 | 33207 
33207 | 117493 | 150700 


Q= 9611, = 2728. 


(11) 32352 | 85974 | 118326 
22 | 32352 | 32374 
32374 | 118326 | 150700 
Q='9964,  =:2728. 
(13) 32302 | 86095 | 118397 
1 | 32302 | 32303 
32303 | 118397 | 150700 


Q= 9998, $= 2728. 


(2) 47952 
27398 


27398 
47952 


75350 
75350 


75350 | 75350 | 150700 


Q="5078, ¢= 2728: 
We now proceed to adjust the latter table so that @ remains stationary and 


(4) 43052 | 56385 | 99437 
8211 | 43052 | 51963 
51263 | 99437 | 150700 


Q = 6003, $= :2728. 


(6) 37952 | 71808 
2988 | 37952 
40940 | 109760 


109760 
40940 


150700 


Q='7407,  ='2728. 


(8) ~ 33552 | 83090 | 116642 
506 | 33552 | 34058 
34058 | 116642 | 150700 
Q = 9288,  =°2728. 
(10) 32552 | 85499 | 118051 
97 | 32552 | 32649 
32649 | 118051 | 150700 
Q=-9845, = 2728. 
(12) 32327 | 86035 | 118362 
11 | 32327 | 32338 
32338 | 118362 | 150700 
Q= "9982, ob =:2728. 
(14) 32298 | 86104 | 118402 
0 | 32298 | 32298 
32298 | 118402 | 150700 


| 


Q=10000, $=-2728. 
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The two series of results just considered show us that with ¢ constant we can 
make @ rise to unity and with Q constant we can make ¢ fall to zero. It is there- 
fore always possible to determine or to find in actual practice series of tables in 
which an ascending order of @ is accompanied by a descending order of @. That 
is to say the two coefticients will flatly contradict each other. There is no basis 
whatever for Mr Yule’s assertion that in Sheffield the correlation of vaccination 
and recovery is highest, while in Homerton and Fulham it is lowest. With the 
same three values of Q that he gives, we can make any order of the ¢’s we please. 
In the actual tables the relative order is not that given by Mr Yule; the order of 
tetrachoric r agrees with that of @; so does the coefficient of mean square con- 
tingency and further the probability of independence, i.e. the probability that 
death is independent of vaccination. These are given as 7, C, and P in the 
last three columns of the table on p. 173, and they entirely reverse Mr Yule’s 
judgment. P is of course in a different class to any other of the coefficients, 
but we return to this point later. 


Mr Yule states that any table may be reduced without modification of the 
association to its equalised form; for example the tables 


998,667 666 | 999,333 and 300,000 | 200,000| 500,000 
666 1 667 200,000 | 300,000} 500,000 


999,333 | 667 /1,000,000 500,000 | 500,000 } 1,000,000 


are “equivalent,” both have Q ="385, but the inferences to be drawn from the two 
tables had they originated independently are quite different. In the first case the 
probable error of Q=‘288, and the result is not definitely significant. In the 
second case the probable error is ‘001 and there is no doubt of the significance. 
If Q_ be the Q of an original table and Q, of the equalised table, then we have 
Mr Yule’s vaccination data : 


p.E. of Qy p. E. of Oz 
: | 
Sheffield es Ane ‘007 | 003 
Leicester ae. wee 065 | 022 
Homerton and Fulham ‘O07 | “O05 
| 


Mr Yule has not entered into this question of the probable errors of the series 
of his modified tables. The statistician, however, whose long experience enables 
him closely to associate given types of tables with given degrees of reliability, 
is largely deceived when an “equivalent table” is presented to him of which the 
probable error as it stands may be } to ;1, or less of the true probable error of 
the coefficient given. 

We have shown in this section of our discussion that the two coefficients Q and 


¢ cannot both be valid. Mr Yule nowhere adequately assigns the type of cases to 
which the one or the other should be applied. He tells us that he hopes the 
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“comedy of errors” has now ended, that comedy consisting in our overlooking the 
“fact” that it is “a very obvious matter” that ¢ is applicable in its entirety to the 
2 x 2-fold table. That applicability is the very point to which Pearson took ex- 
ception in Dr Boas’ use of ¢. Were it a fact, Mr Yule might throw both his Q 
and his w overboard, for what more is required than a method applicable in its 
entirety to a 2 x 2-fold table? And the @ method at all points contradicts the 
Yulean results. But it is not true, for at best it would be a method of ranks, and 
the correlation of ranks is never a correlation of variates unless the ranked 
quantities proceed by absolute units of variation—as for example in the theoretical 
Mendelian case to which Pearson perfectly legitimately applied it—or in counting 
the teeth on the carapace of a prawn or the veins on a leaf. 

Mr Yule seems to consider that the inoculation with an antitoxin is equivalent 
to the addition of a unit of something to the individual; we consider that this is 
wholly erroneous. ‘lo begin with, the dosage is not uniform, its repetition does not 
always occur at the same interval and the number of doses is not always the same ; 
further the interval between the onset of the disease and first inoculation is by no 
means the same; lastly, apart from the resistance of the individual patients to the 
disease, the curative effect of the treatment depends on the relation of the antitoxin 
administered to the physiological individuality of the patient. It is idle therefore 
to consider this varying complex as a quantity undifferentiated from individual to 
individual. The group treated with antitoxin is not made up of identical indi- 
viduals but of a number of persons with increased power of resistance to the disease, 
which may vary from the case of a person who has gained nothing by it to that of 
a person who has immensely increased his power of recovery. In precisely the 
same way those who have not been treated can by no means be grouped into 
a single quantitative class; it may be doubted, indeed, whether recuperative 
power when disease is incurred is really divided sharply by a line like treatment 
or non-treatment with antitoxin. It may be only the sharpest division we can 
take under the circumstances, and in our ignorance of the nature of the distribution 
a tetrachoric 7, may be as effective a measure of association as Mr Yule’s Q. At 
any rate no man of “ordinary intelligence” would believe that perfect association 
existed between treatment and recovery because out of 23 persons treated none 
died, while out of 977 not treated six died, yet this would be the result provided by 
Mr Yule’s coefficient of association! Clearly if only 0°6°/, died without treatment, 
we should not expect any to die in a sample of 23 whether treated or not treated. 
The vanishing of association for a zero quadrant is a patent fallacy *. 

If the problem were presented to us as Mr Yule states it, Le. the evaluation of 
a new treatment, we should certainly not use his Q for solving the problem. We 
should probably to-day not use a tetrachoric 7;, except as a control. We should 
most likely use ¢, although certainly not in the sense of a correlation of four points. 
We should question how far death and recovery are independent of treatment or non- 
treatment; that is to say, we should ask what is the probability that recovery 


* This topic is of such importance that we have discussed it more at length in Appendix I. 
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was independent of treatment. If n be the total number of observations, then, if 
ng? = y* be calculated, the chance of independence is at once given by finding the 
P corresponding to y? from the tables for “goodness of fit” im the case of n= 4. 
The column of P in the table of p. 173 shows us that the Homerton-Fulham data 
stand at the head of the series in this respect. This is of course in great part due 
to the larger numbers dealt with, but obviously in such a question as the effective- 
ness of treatment weight must be given to numbers. 


If we turn to the problem of 7 and C, considered merely as coefficients of 
association, we must examine what Mr Yule has laid down as the desiderata of such 


a coefficient. The fourfold table being : - he has asswmed that the coefficient 
must range from — 1 to +1, and that if any one of the cells be empty the coefficient 
id — be Vad — /be 

must be +1 or —1. He then guesses a formula ““—"" or another —“"—~"" 
S ad + be Vad + Vbe 


out of the many thousands which can be invented*. 


But is either of the assumptions above really necessary? Why should the 
association be perfect when b=0? Why should it be perfect even if both b and c 
are zero? Let us toss a shilling and a penny and record heads or tails of both. 


Shilling. 
| Head | Tail | Totals 
os Head es 1 (0) 1 
(=| 
q 
o 
Ay Tail ay 2 1 3 

| 
Totals... 3 1 4 


We do it four times and the result is as above, on the whole not a very improbable 
result. But according to Mr Yule there is absolute association, and since the 
probable error according to him is zero, the result is absolutely reliable. Clearly 

* See Pearson, Phil. Trans. Vol. 195 A, p. 15. If ¢(z) be a function which vanishes with z, 


then any form 


1b (ad) — (be) }/{ (ad) + (be) } 


satisfies Yule’s requirements. Or, we can take a form 


if @(k) be finite for k=a, where x=(hc)/(ad). Clearly for the range 0 to © of values of x, Qx ranges 
from +1 to —1 and satisfies Mr Yule’s conditions if #(») is > (x), but by an arbitrary choice of ¢ we 
can get any form of Q-curve we please. No curve of real significance can be obtained, i.e. no reason- 
able value of an association coefficient by the simple condition of fixing three of its values without 
other hypothesis ! 


Biometrika 1x 23 
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@ would be better than this, for it is equal to ‘35 with a probable error of ‘15 
which makes it for practical purposes unreliable. Or again, we repeat our experi- 
ment and find 


Shilling. 
Head Tail Totals | 
= Head aa 2 | (0) 2 
gS | 
Ss 
o 
o Tail eects BO 2 2 
| | 
Totals ... 7) 2 4 


Here @ is again unity or there is absolute association, and the probable 
error is zero or this association according to Mr Yule is also absolutely reliable. 
Further ¢=1, and its apparent probable error is zero, but this is only apparent 
because the calculation of the probable error (as indeed of that of Mr Yule’s Q, 
although he has not noticed it) is incorrect for such a case. 


These simple illustrations seem to indicate that there is nothing in the nature of 
things which necessarily demands that the association shall be unity when either 
b or c or both are zero. On the other hand the probability P (as derived from @, 
the mean square contingency) that the heads and tails of shilling and penny are 
independent is more than ‘90, and in the second case more than ‘25. I% seems to 
us therefore that this mean square contingency method which gives reasonably 
satisfactory results, where the Boas-Yulean goes hopelessly astray, is far more 
likely to be preferable in the case of medical treatment to which Mr Yule proposes 
to apply his coefficient. 


There is another point also to be considered. Why should the range of a 
good coefficient of association lie for any given number of cells between +1 and 
—1? Let us examine the following table : 


(A) | 


Totals 


40 
360 
570 

30 

30 
570 
360 

40 


If we treat each sub-range here as unity we find the correlation negative and 


equal to — 1120. If therefore we assume this to be the ultimate distribution, this 
is the correlation coefficient. 


Kari PkARSON AND Davin Hrron 179 


Now combine 6 and c,h and 7%. Proceeding in the same way to find the Yulean 
pseudo-ranks 7, we have now for the following table : 


(3) a | b+e d e | VF g h+i j. {| Totals 
a 40 Os eon th hoe No ol) 0 0 40 
B 0 | 200 0 0 10 | 110 | 40 0 | 360 
y 0 0 0 0 10 | 440 | 120 0 | 570 
5 0 0 0 0 10 | 20 0 0 30 
C 0 0 | 20 10 0 0 0 0 30 
¢ 0 | 120 | 440 10° || 6 0 0 0 | 570 
7 Oy |) 40 4, L10 10 0 0 | 200 0 | 360 
6 ) 0 0 0 0 | oO} 0 | 40 40 

\ 
Totals] 40 | 360 | 570 30 | 30 


the value is positive and equal to + 0050. 


Now club d and e, f and g, y and 6, ¢ and € together, and we have 


(C) | a b+e | d+e | f+9 | htt j Totals 

a 40 | 0 0 ) 0 0 40 

O | 200 O 120 40 0 360 

yt6 (0) O O 480 120 O 600 

e+¢ (0) 120 | 480 0 10) 0) 600 

0 40 120 0) 200 O 360 

0 Or OF) 0 0 | 40 40 

Totals 40 360 600 600 360 40 


The Yulean pseudo-ranks r is now +°2562. 


Combine a and b+, h+7 and j, a and B, 7 and 6, and we find 


«D) atbt+e | d+e| f+g | h+i¢+j ] Totals 
a+B 240 O-) 120") 40 400 

y+6 0 0 | 480 | 120 600 

ea 120 480 O O 600 

n+6 40 120 0 | 240 400 

Totals 400 600 600 400 2000 


The Yulean now drops down to +1429! 
Combine d+e and f+ g, y+ 6 and e+ € to give 
CD 13 


d+e+f+g h+itj Totals 


Totals 


The Yulean is now +°5000. 
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Combine b+c,d+e, f+g, h+% together; 8, y+ 6, €+6€, 7, and we deduce 
(F) | 


a | b+etdtet+f+gthti | j | Totals 


oe 40 ) 0 40 
| Btytbteblty ) 1920 0 1920 
= 0 40 

Totals 2000 


The Yulean has now reached perfect correlation, or 7 = + 1:0000. 


Combine a, b, c, d, e together and f, g, h, i, 7; a, B,y, 6 and e, 7, 0, and we 
have 


(G) | oto+e+ate | ftgth+ity | Totals | 
at+Bty+e 240 760 1000 | 
e+C +46 760 240 1000 


Totals 1000 | 1000 2000 


The Yulean is now negative and —*5200. 


But if we write the table as a fourfold thus: 


(H) atb+c+dt+e+ft+g+ht+i ] Totals | 
atB+ytd+et+¢+n 1960 
6 0 40 
2000 


the Yulean would be again unity and mark perfect correlation of a positive kind. 


Mr Yule’s coefficient of association would also be positive and mark perfect 
association. 


Now it is not open to Mr Yule to turn round and assert that such tables are 
extremely unlikely in practical statistics, first because his condemnation of the 
coefficient of contingency is based solely on the creation of an artificial table 
in exactly the same way, and secondly because he asserts that once we dismiss 
the idea of Gaussian frequency the method of correlating ranks with big ‘ brackets’ 
becomes applicable. Our tables bring out, however, three important points: 
(i) that two variates with an actual correlation of —‘112 may exhibit any corre- 
lation between — 52 and +1:00 when treated by the Yulean process of pseudo- 
ranks ; (11) that Mr Yule’s coefficient of association may cover under the heading 
‘perfect association’ almost any value of the real relationship, it is merely an 
association of common names, i.e. class-indices, and not of the real variate beneath 
these class-indices; and (iii) that the assumption that a fitting measure of 


association should give unity for a fourfold table of the form 5 | . is by no 
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means obvious. It is conceivable that a better measure of association would give 
a limit below unity for such a case, while providing a limit nearer and nearer to 
unity as the information given with regard to what really occurs inside the broad 
categories is more and more complete. That is to say, a desirable coefficient of 
association would be one which would always lie numerically between 0 and 1, 
but which would not take the absolute value 1, unless far more detailed infor- 
mation were provided than is given in the statement of such a table as (H). 
From this standpoint we see at once how idle is Mr Yule’s criticism of the co- 
efficient of contingency. He suggests that it is invalid because (1) it has an upper 
limit less than unity, when the contingency table has a limited number of cells, 
and (11) its value rises when you increase the number of cells. It is less than unity 
in the first case, because we are ignorant of what may happen when we analyse 
the contents of the big cells; it increases in the second case because we have 
additional knowledge. It only becomes unity when one character A is absolutely 
fixed by a second B, i.e. when A is a function of B*. The coefficient of con- 
tingency is a valid measure of association, whether the table be fourfold+ or 
nx n'-fold. It presents far fewer logical anomalies than Mr Yule’s Q or the Boas- 
Yulean ¢, and it readily admits of our calculating, what for many cases is essential, 
the probability that the two attributes are independent. 


But Mr Yule dismisses the coefficient with (a) a quite unreasoned criticism 
that it increases in value as the number of cells increases, and (6) an illustration 
that it is not equal to the coefficient of correlation for one particular table of 
heterogeneous material, i.e. for a surface of zero correlation with a cock’s comb 
of absolute correlation erected along its diagonal. Nobody, as far as we are aware, 
ever asserted it would be. The assertions made with regard to the coefficient of 
mean square contingency may be summed up as follows: (i) for any frequency 
distribution the coefficient of contingency is a reasonable measure of the extent 
of the deviation of the attributes from absolute independence, and (11) for such 
frequency surfaces in homogeneous material as occur in actual practice the co- 
efficient of contingency, if the proper corrections are made}, gives a value close to 
the coefficient of correlation, whether we divide the table up into 3 x 3-fold or 
8 x 8-fold groupings. The skewness of the distribution—its deviation from Gaussian 
frequency—is not a very disturbing factor, as we shall show in the sequel. When we 
take material which has—if there be an indefinite number of cells—an indefinitely 
great improbability of independence, i.e. material for which C,=1, we shall not 


* See Pearson, Grammar of Science, 3rd ed. p. 162. 
+ The probable error of a coefficient of contingency C2 for a fourfold table is 
1- 0,2 Co (1-40), 
Jn (1 — C42) 

where \ and yu have the same values as on p. 170. It does not become zero when the Boas-Yulean # 
is equal to unity, unless \=u=0. 

+ These corrections have been several times referred to (see Grammar of Science, ed. 1911, ftn. 
p. 163) and have been in use in the Laboratory, but the further memoir on Contingency which has been 
for some time in hand has been delayed owing to pressure of other work. It will shortly be issued and 
deal more fully with the corrections merely stated in this paper. 


*67449 
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reduce this infinite improbability by diluting it with a finite amount of non- 
correlated material; and this approach of C, to unity is all that Mr Yule’s artificial 
cock’s comb surface illustrates. If A is an infinitely improbable event, then we 
shall not lessen the improbability of the whole by combining the event A with B 
which has a zero improbability * ! 


Mr Yule cannot determine the efficiency of contingency methods by simply 
asserting that the value of the mean square contingency depends on the number of 
cells; it naturally alters with our increased knowledge, but. this change may mark 
either an increase or a decrease according to the manner in which the material in 
the few cells is redistributed in their component cells. We assert that with a 
3 x 8-fold table you cannot get further than the contingency, and that further pro- 
gress can only be made by some other assumption as to the frequency distribution 
of the variates. With that assumption we think we shall be able to demonstrate in 
the course of this paper to the unprejudiced reader that the coefficient of contingency 
properly handled is, perhaps, the most powerful instrument of modern statistical 
theory. The assumption we make is that for correcting the results obtained by con- 
tingency, so that coefficients found for 3 x 3-fold, 4.x 4-fold, 5 x 5-fold, ..., 8 x 8-fold 
tables may give practically identical results, it is sufficient to deduce the required 
corrections by using a Gaussian hypothesis to determine certain means. The method 
gives excellent results for the bulk of the distributions which occur in our wide 
experience of statistical work. If we can show that it gives good results for the 
extremely skew cases which Mr Yule has gone out of his way to cite, our point 
will be proved. Since the full development of the contingency method, fourfold 
tables have not been used by the Biometric School, except as controls, where 
contingency tables could be formed on the given data. But the statement that 
contingency was developed in order to overcome the difficulties of the fourfold 
table methods is directly disregarded by Mr Yule when he turns to our pigmenta- 

* A is the probability that in examining two absolutely independent variates, n cells shall be 
occupied and n?—n cells empty when we make n infinite. B is the probability that the n? cells shall 
each have their theoretically independent contents. No combination of these two events can give less 
than an indefinitely great improbability, i.e. Cp=1. But we anticipate that if Mr Yule had not raised 
his cock’s comb at such a conspicuous angle to the rest of his surface that its heterogeneity would 
be readily visible to the trained statistician, there would be no very serious error introduced by 
applying the mean square coefficient of contingency even to moderately heterogeneous material. We 
have not had leisure to investigate the matter closely, but if we superpose two Gaussian frequency- 


surfaces with identical means, with the same standard deviations for both variables, and with cor- 
relations 7; and 72, then the true correlation by product moment is 
pP=Prit+ qr, 
where pN and qN are the total frequencies of the two components and p+q=1. 
On the other hand 


Co= Ny) p2(L+ 712) — 2pr1 12 (M1 +72) +1127? (L+7172) 
i pe (L+11 72) — 2pryre (7 +19) Frere? (L+ 772) + (1 = 712) (l= 792) (1-179) ° 
If 7; =1, r2=0, as we know Cy=1, while p=p. But if 7 =-2, m="7, with p=°3, g=°7, the mixture pro- 
portions of Mr Yule’s illustration, then p=°55 and C)="59. If 7)=°5, re="7, p="3, q="7, then p=-64 
and C2.=°65. Again if 7)=°7, r2='3, p= "4, g="6, then p='46 and C,="49. Thus it does not appear 
that small amounts of heterogeneity, not detectable on a study of the table, are likely to give very mis- 
leading values when C2 is taken as a practical measure of p. 
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tion data*. The recognition, however, that the fourfold table may give discordant 
results—a recognition made by the Biometric School within four years of the 
publication of the pigmentation investigations for eye-colour in man and coat- 
colour in horses—does not dismiss the fourfold table from practical statistics, but 
only from that portion of it where multiple contingency tables are available. 
Given a fourfold classification alone, how is it to be treated? We reply unhesi- 
tatingly that in the great bulk of cases the use of tetrachoric 7; is the best treat- 
ment. We base this on the experience that where nothing is known the Gaussian 
is far more likely to describe approximately the frequency than any other hypothesis. 
Even taken as a mere coefficient of association, tetrachoric 7, is better than 
Mr Yule’s Q or the Boas-Yulean ¢, except for absolutely discrete units as in 
the purely theoretical Mendelian cases ; and in those cases the correlation of ranks 
is the correlation of variates, as Pearson indicated in his memoir of 1904+. 


(5) On the Surface of Constant Association and on “ Natural” Equalisation. 


As we have indicated, Mr Yule never states adequately the conditions under 
which his coefficients of association and colligation are to be applied. He 
apparently considers the nature of the continuity of his frequency surface, if his 
variates are continuous, to be absolutely immaterial. Now in the case of every 
two continuous variates, whatever their nature, a frequency surface does exist for 
which Mr Yule’s association or colligation is constant wherever the divisions may 
be taken upon which the fourfold table is based. Let m,, be all the first quadrant 
frequency corresponding to the total frequencies p and q of the two variates, where 
p and q are supposed to be absolutely known. If the association coefficient be Q 
and the notation 


| 


a b | a+b 


c d= etd Ue 


a+c=p, 
ad+c|b+d|) WN 


we have (1+ Q)/(1 — Q) =ad/be = x, say. 

* In the very same number of Biometrika (Vol. m1. 1904) in which the Huxley Lecture appeared, there 
is a paper on the inheritance of pigmentation in the Greyhound ; it is the work of Pearson’s Laboratory 
and started about the same time as the Huxley Lecture reductions. The following words occur: 
““ When we first started work on the greyhounds, the method of contingency had not been developed, and 
accordingly we made tables for the inheritance of melanism and of red pigment and proceeded to find 
the correlations by the fourfold division process” (I. c. p. 252). And again “ In order to compare the 
fourfold method with contingency methods, 16-fold tables and 25-fold tables were worked out to 
compare with the fourfold tables adopted for the inheritance of red and black pigment respectively ” 
(p. 253). ‘*The results deduced by contingency D method are singularly uniform and steady as 
compared with those of the fourfold-table methods, and we believe, if it be adopted generally for 
such pigmentation problems, it will not only free us from any question of pigmentation scale, but 
afford a good result on a not excessive expenditure of calculating energy” (p. 253). It is clear that 
the Laboratory publicly admitted the difficulties of the fourfold-table method two years before Mr Yule 
started to criticise it as applied to pigmentation data! Yet Mr Yule never mentions this fact. 

+ ‘*On a Generalised Theory of Alternative Inheritance with special reference to Mendel’s Laws,” 
Phil. Trans. Vol. 203, p. 53. 
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Then L=Nygqg, D=G—Nyq, C=P—Nyq, C= N—p— q+ Ngq. 
Therefore — (x — 1) pq — %pq (X— 1) (Pp +9) + N} + pax = 0 


is a quadratic to find n,,. If we take as we always can arrange to do Q positive, 
then x lies between 1 and © or is also positive. The equation for ny, will have 
real roots if 

' (x —-1) (p+ 9) + NP > 4y (x - D pe, 
Ord 


(x —-1? (p-@F + K-11) (op —- 9° + GN -—p- D(x -Dpt+ Dt? > 0, 


which is true since y >1 and 2N>p+gq. Hence, since ypq/(y —1) is always 
positive, the quadratic has always one and only one real positive root. 


eee ek ONyg _ N= Nog (Xe) 
Further : ap eg on Cay 


but y is > xy — 1, ¢> Npq, and p > Np, therefore it follows that both numerator and 
denominator are positive, or 6m,, increases with p. Similarly it increases with gq, 
or in subtracting np, from either np+p,¢ OF Np, ¢4+8q We shall never reach a negative 
difference. Thus it is always possible to construct a surface for which Q is 
constant for every fourfold division. It seems to us that had Mr Yule realised 
the possibility of this surface and studied it*, he would have known more about 
the real properties of his Q and its bearing on such distributions as occur in 
practice. The fact that in every distribution of continuous variates we come 
across there is no approach to constancy in Q, that it varies continuously and 
almost in a predictable manner shows how very far the surface of constant Q is 
from representing the facts of experience. Still had Mr Yule fitted the best 
surface of constant Q to a known distribution of detailed data, and so ascertained 
his value of Q, he would have given us a coefficient which would have lived in 
statistical practice and theory, and he would have thrown real light on the relation- 
ship of association to correlation. We have not spent time in discussing the 
complicated equation to the surface of constant Q, but we have provided one 
illustration of such a surface. Taking the total frequencies of each eye-colour 
group in Father and Son, only adding 5 and 6 together, we have Table I given 
below for the eye-colour distribution categories; this table would within the 
limits of our decimal places have the same coefficient of association, Q = 0°6, 
wherever we divide it into fourfold tables. For example, taking both divisions 
between 2 and 3 we have the fourfold 


191°55 | 143-45 | 335 


166°45 | 498°55 665 Q= "59996. 


358 642 1000 


* We suggest that Mr Yule has not studied the matter, for he writes: ‘‘ There is one case and one only 
where Q is independent of the axes chosen, and that is where the variables are strictly independent,” 
Phil. Trans. Vol. 194, p. 278. For the equation to the surface see Appendix ITI. 
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Or, again, between 6 and 7 vertically and 3 and 4 horizontally 


576-60 | 42:40 | 619 
294-40 | 86-60 | 381 g=-60002 
Soa Ee! =a d 
871 | 129 1900 


which sufficiently illustrate the feasibility of the construction of the surface. 


But at once the fictitious character of Mr Yule’s idea of association comes to 
light. He tells us that it is “natural” to put a table in the equivalent symmetrical 
form—one for example in which we have 50°/, deaths and 50°/, recoveries, 
50 °/, vaccinated and 50 °/. unvaccinated. And this is possible because his Q does 
not change by such a process. But surely if it is “natural” to have equal 
numbers of deaths and recoveries, it is also “natural” to have equal numbers of 
fathers, and for the matter of that equal numbers of sons, in each eye-class. 
Equal light blue eyes, and equal dark blue eyes, equal light brown eyes, equal 
dark browns and blacks. But the instant we study a table of this kind some 


TABLE I. 


Table of Variates in Futher and Son Eye-Classes for Constant Association 
Coefficient 0°6. 


1 is Ee J | 5+6 | 7 | 8 Totals 

1 4:08 | 19°04 | 6:25 260 | 0-78 1:00 0°25 34 

g 19°54 | 14889 | 75:33 | 32°54 | 9-68 | 12:05 2°97 301 

3 7-41 | 90°46 | 90-17 | 52:97 16°86 | 21-03 5°10 284 

yi 2:19 | 284 | 39-43 | 33:19 | 12°58 16°89 4:23 137 

5+6 1°35 | 1712 | 25°63 | 296-40 | 11:82 | 17°83 | 4°85 105 
7 1:03 | 1315 | 19°94 | 23°33 12:18 | 21:60 | 6°77 98 

8 0-40 | 4°80 7-25 8:97 | 515 10°60 | 3°83 41 

| | | 
Totals | 36 322 264 180 | 69 101 | 28 1000 


remarkable points arise. (i) The selection which is advocated by Mr Yule and which 
keeps his coefficient constant is a perfectly arbitrary one, and (ii) it must be stopped 
at a perfectly artificial limit, namely that at which the arbitrary division has been 
made. Mr Yule does not equalise each sub-group, but if, for example, he divides into 
light and dark eyes between 3 and 4, he multiplies his light blue eyes, his dark 
blue eyes and his greys, not by different factors which would equalise these groups 
with each other, but by the same common factor, so that the sub-groups really 
remain in the same proportions as before the equalising. Surely if it is “natural” 
to equalise light and dark eye totals, it is equally “natural” to equalise the totals 
of light blue and dark blue eyes. If it is “natural” to equalise the vaccinated 
and unvaccinated totals, it is surely for the same reason “natural” to equalise the 
several groups of individuals who have one, two, three vaccination scars, for the 
number of their scars “is dependent to a large degree on a purely arbitrary 
circumstance, the activity of the authorities” in making 1, 2, or 3 punctures. 
It is equally “natural” to equalise the numbers who have lived 5, 10, 20, 30 
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or 40 years since their last vaccination, for this “is dependent to a large degree 
on a purely arbitrary circumstance,” the question of whether the population has 
been recently alarmed and revaccinated. The efficiency of vaccination depends 
both on size of cicatrix and on interval since vaccination. Yet if Mr Yule makes 
these equalisations he will wholly change his association coefficient. That coefficient 
admits not of a natural selection, but only of a wholly artificial one, namely, one 
which alters all sub-classes on one side of an arbitrary dividing line in the same 
ratio and not in their “natural” ratios. Every word Mr Yule has uttered with 
regard to equalisation and its “naturalness” applies equally to classifications in 
3, 4 or more groups. Why should the relationship of father to son in eye-colour 
depend upon the number of light blue eyed fathers taken? It is “natural” 
to equalise them with the dark blue. But this is only possible if the multiplying 
of a row or column will not influence the result, and the reader has only to test 
on the above table how Q is changed when he starts such a selection. A few of 
the values we have obtained are Q=°47, ‘44, ‘36, °17 and 04! The fact is that 
Mr Yule’s is not a general selection, but a perfectly artificial one, which is 
not in the least “natural” when we analyse its effect on the constituents of a 
given class. 

It is as well to stop and inquire what influence this “natural” equalising 
has on contingency tables. Below (Table II) is given the table for eye-colour in 
Father and Son reduced to five classes on account of the labour involved. The 
equalising factors are 


For the Rows : For the Columns : 
Yirz=—o' 1aol2, Mie= 61949, 
ys =3'92073, Pees lye, 
ys = 711764, «, =1:00000, 
Yo4e= 147842, 544 = 238871, 
Yr+s = 635625, L748 = 134437, 


and the resulting table is given as Table IIT. 
TABLE IL. 


Actual Hye-Colour Table for Father and Son in Five Classes. 
EKye-Colour of Father *. 


¢ 1+2 3 4 | 5+6 | 748 | Totals | 
N ———— | 
“ SB | 
i 284 | 
S ey | 
< 105 
oO 139 | 
2 pare | 
ar) 


* See Phil. Trans. Vol. 195 A, p. 138, 
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TABLE III. 


Hye-Colour in Father and Son equalised, or put into 


Mr Yule’s “ Natural” Groups. 


142 


Kye-Colour of Father. 


T+8 


Totals 


E 3 4 54+6 

ies 142 1000 
= 3 1000 
s 4 1000 
= 546 1000 
O 7418 1000 
d 

g Totals | 1000 | 1000 5000 


In this case the @ was not constant for all divisions at starting, and therefore 
there is no general standard. But Mr Yule tells us that his method of pseudo- 
ranks is the best known to him for such a table. The Yulean for this table before 
equalisation was 403, after equalisation it is ‘166! @ can change as much as from 
‘52 to 43. It is only an “unnatural” equalisation which keeps Q constant. 


Let us consider another case of this, taken from the one field to which Mr Yule 
in his present paper ventures to apply his coefficient, that of vaccination *. 


TABLE IV. 
Severity of Small Pow. 


Haemorrhagic | Confluent | Abundant] Sparse a ory | Totals 

parse 
Vaccinated. py apy ie 
Years since } 5 ue 932 
vaccination Over 45 155 
Unvaccinated 115 
Totals... | 484 1689 


The equalisation factors are : 


For Rows: For Columns : 
yy, = 61300, 2, =11:0816, 
Yy2= ‘3631, i wl oll, 
Ys= 1496, v= 1:0000, 
Ys= °6923, o,= 1:0113, 
ys = 10697, t= 11659, 


reckoning rows and columns from the left-hand top corner, 


* See Biometrika, Vol. vit. p. 257, 
24—2 
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After equalising the classes of this table we have the following one : 
TABLE V., 
Vaccination Grade and Intensity of Disease. 


Severity of Small Pox. 


| Haemorrhagic | Confluent Abundant | Sparse sues Totals 
2 

| Vaccinated. Oe ef) 

Viera 10.—25 200 
ears since 2545 200 
vaccination | eee 
Over 45 200 
| Unvaccinated ee 200 
Totals ... 1000 


Now here are some of the changes that take place in Q, owing to this 
equalisation of the classes : 


TABLE VI. 

Vertical Division Horizontal Division Old Q | New Q 
Confluent-A bundant .. | Vaccinated-Unvaccinated — 7220 — ‘7070 
Abundant-Sparse ... ... | Waceinated-Unvaccinated —°8599 | — 8945 
Confluent-Abundant .. | 25—45 and over 45... — 5714 | -- "7522 
Confluent-Abundant its 10—25 and 25—45... —°5411 | —°8163 
Abundant-Sparse ... 2.) | ~10-=95 and y2b—-45 sean. — 4469 — 7742 
Haemorrhagic-Confluent... | 10—25 and 25—45... -— ‘5711 | —-7798 
Haemorrhagic-Confluent... | 25—45 and over 45... —'5449 | —*5861 
Haemorrhagic-Confluent.... | | Vaccinated-Unvaccinated — 1009 — ‘1371 

| 


The range of values found for Q for a single table will be considered later ; 
it is sufficient here to indicate how largely the bulk of them are changed by 
equalisation of the groupings. If further evidence be needed for the radical 
changes which invariably accompany Mr Yule’s conception of a “natural” 
grouping, we may note that the Yulean deduced from pseudo-ranks is before 
equalisation — 3099, but after equalisation —°5375. These two equalisations of 
class frequency show that it is impossible to predict a priori how the relationship 
of the two variates will be changed by the process; in the first the relationship 
was lowered by 59 °/, and in the second case raised by 73°/, of its value as 
estimated by the Yulean*! It is clear that only a fictitious type of equalisation 
has been used by Mr Yule. His independence of selection is only an artificial 
one; sub-groups within his categories are not equalised but only equally selected ; 
and further such selection must neither fall short of nor exceed the arbitrary 
division he has selected for his class. If for example we agree to equalise each 


* Consider as a last illustration the Table VII below. The correlation is ‘37. The reader will find 
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age group of the vaceinated—surely a “natural” process—then we shall modity 
the coefficient of association obtained by dividing between vaccinated and un- 
vaccinated. If we equalised the totals of vaccinated and unvaccinated, then Q) 
would have changed, if the division were taken at an interval of 25 years since 


vaccination. The whole process is quite arbitrary, and we believe wholly without 
validity. 


(6) On the Coefficient of Association and the Assumption of Discrete Variates. 


Mr Yule has asserted that when we free ourselves from any necessary relation 
to the theory of normal correlation, the ordinary theory of correlation is applicable 
in its entirety to the 2 x 2-fold table. Apparently he holds that the same is true 
for his coetticients of association and colligation although these different methods 
lead to diverse and often contradictory results, for the simple reason that selection 
vastly affects one and does not affect the other. He has not considered the 
surface of equal association, nor discussed the cases to which the “ four-point” 
surface, involved in using ¢ as Boas and he propose, can be legitimately applied. 
It is accordingly of interest to see what happens when the coefficient of association 
is applied to various types of frequency surface. 


It is no valid reply to criticisms based upon such an investigation to say that 
such frequency surfaces do not occur in practice. Mr Yule has never entered 
into any discussion of the character of the distributions to which he applies his 
association ; it is sufficient for him that they give a fourfold table, and he makes 
no appeal, as we do, to experience as a basis for the adoption of any coefficient. 
It is therefore possible to test his association against the clear idea of correlation 
on any distribution whatever. 


If Mr Yule replies that this is not fair treatment because the coefficient of 
association applies to discrete quantities, we answer that he has never defined 


it an amusing task to equalise the total frequencies; he will then discover that it takes an interesting 
form, that of an old friend of Mr Yule’s, and the correlation will then be recognised as °50. 


TABLE VII. 
First Variate. 


Totals 


a 11 20 30 | 35 40 45 50 40 | 30 2 303 
3 b 20 4400 606 700, 800 | 900 1000 800 | 600 40 9860 
| @ 30 600 9900 1050) 1200 1350 1500 1200 | 900 60 17790 
a d 35 700 1050 13475 1400 | 1575 1750 1400 , 1050 70 22505 
> e 40 800 1200 1400 | 17600 1800 2000 1600 =1200 80 27720 
SS f 45 900 1350 1575 1800 | 22275 2250 | 1800 1350 90 33435 
8 g 50 1000 1500 1750 2000 | 2250 | 27500 | 2000 1500 | 100 39650 
3 h 40 800 1200 1400 1600 | 1800 2000 | 17600 1200 80 27720 
wn i 30 600 900 1050 1200 1350 1500 | 1200 9900 60 17790 

j 2 40 60 70 80 90 100 80 60 44 626 

Totals 


303 9860 | 17790 | 22505 | 27720 | 33435 | 39650 | 27720 | 17790 | 626 197399 
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discrete quantities, and we can only judge what he means by this term from seeing 
the cases to which he has applied it. We shall therefore deal first with the 
discussion of the cases to which Mr Yule has applied his methods, and then 
consider their effects as applied to (a) continuous, (b) discrete variates. 


Mr Yule opens his paper by observing* that if we classify objects into two 
classes only, for example “peas into yellow-seeded and green-seeded, or the 
members of any group of mankind into male and female, the resulting data are 
of the simplest possible form.” The data may be thus in the simplest possible 
form, but difficulties might occur even in such simple cases as those cited by 
Mr Yule. The classification even into yellow- and green-seeded peas is by no 
means so simple as Mr Yule suggests, and certain types of hermaphrodite in man 
might undoubtedly puzzle even Mr Yule’s powers of discrimination. To under- 
stand really what Mr Yule proposes to classify we must turn to the cases to which 
he has applied his method. They are as far as we can judge from an examination 
of his writings the following: (1) Good and Bad Temper, (2) Presence and Absence 
of the Artistic Faculty, (3) Stature in Man, (4) Tallness in Plants, (5) Mental 
Dulness, (6) Low Nutrition, (7) Defects in Development involving “size, form or 
proportioning of parts,” (8) Abnormal Nerve Signs, involving “abnormal actions, 
movements, and balances,” (9) Mental Derangement involving imbecility and 
idiocy, (10) “ Blindness,” (11) Deaf-mutism, (12) Recovery and Death in the case 
of Smallpox, (13) Vaccination or non-vaccination, (14) Male and Female, (15) Cross 
and Self-Fertilisation, (16) Eye-Colour, (17) Colour of Flower, (18) Prickliness of 
Fruit. Mr Yule has probably used or suggested the use of the coefficient of 
association in other casest. Looking through the above cases we see it is in 
the rarest instances, possibly only in (14) and (15), that Mr Yule has confined 
himself to discrete variates. He has applied his coefficient of association over 
and over again to continuously varying quantities. Temper (1), artistic faculty (2), 
stature (3), tallness (4) are all quantitatively determinable variates, even if 
difficult in some cases to measure. One man has a better or worse temper than 
another; one man has a greater or less degree of artistic faculty; where the 
division between presence and absence of the faculty is put, or what is called good 
or bad temper is largely matter of personal equation. But nobody doubts the 
range of such variates; they are continuous, there is no sudden break. 


Now turn to the next four characters (5)—(8). No one who has studied the 
essential difficulty of defining what is feeble-minded will doubt the continuity of 
mental dulness. It is not a discrete character, but a continuous variate. There 
are certainly all grades of mental defect, and the groups idiot, imbecile, feeble- 
minded, “simple” are quite artificial. If the whole population were graded 
according to intelligence, the frequency curve would be continuous; no one knows 
whether it would be Gaussian or not, or whether it would be “humpy” towards 


* Journal R. S. S., Vol. uxxv. p. 579. 
+ He has apparently approved of its application, as we shall see later, to a long series of absolutely 
continuous variates by Professor Niceforo. 
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the tails. If there is a gap between the defective and the normal members of the 
same sibship, intermediates in plenty will be found elsewhere. The whole difficulty 
associated with the Government Bill for the care of the feeble-minded turns on 
the questions how and where to draw the line between the “normal” and “ feeble- 
minded.” Whatever may be finally done, it is quite certain that there will be 
no real distinction between two individuals who fall just one side and just the 
other of the dividing line. Even the personal equation of trained observers will vary 
in classification, and Mr Yule takes the untrained record of thousands of the laymen 
who make census returns as marking off in some manner a “ discrete” character of 
“mental defect.” Absolutely the same remark applies to nutrition. The boundary 
of “low nutrition” is also an arbitrary dichotomy in a continuous variate fixed 
by the personal equation of the observer. (7) tells its own tale, for it is based on 
continuously changing and measurable characters. (8) is less obvious, but not 
only in number, but in quantitative intensity “nerve signs” are really continuous. 
To these also we may add mental derangement (9); there is a very great range 
of variation in imbecility and idiocy as we have already indicated under mental 
dulness. If we turn to “ blindness” (10) the source of it may be most varied, but 
if we define it merely as the loss of the faculty of sight, there is, as the simple cases 
of either congenital or senile cataract might have shown Mr Yule, almost every 
intensity of the failure of sight. Even certain cases of albinism are to be found 
in the Blind Asylums and are educated as semi-blind. Semi-blindness is so well 
recognised that special schools have been started for the semi-blind by certain 
educational authorities. In Bristol out of 22 children sent to the Blind Asylum 
at the expense of the Education Committee 13 had some degree of vision, and 
7 could read large print with varying degrees of difficulty, and were able to do 
some form of handwork by sight. Out of 75 children specially examined for eye 
defects at Bristol in 1911, beyond those requiring glasses were 9 suitable for the 
partially blind class*. Of (11) deaf-mutism, we are less competent to speak, but 
we have been informed by the very best authorities it is far from an absolutely 
fixed condition and that it has a great variety of grades. The grades are more 
marked in the acquired than the congenital cases, but in the latter cases they 
vary with the cause of the congenital deafness. Cases even exist with slight 
degrees of deafness which would probably have been associated with mutism had 
the deafness been more considerable. Different degrees of hearing are found among 
deaf-mutes. There are scores among them who undoubtedly possess an amount of 
vowel hearing, and it helps the tone of their voices when they are being taught 
articulation; this is true, although they cannot distinguish speech without watching 
the lips and without their ears being within two or three feet of their interlocutor. 


* We have had before us the diary of a man who ‘‘ went blind” in old age. At 70 the writing 
is perfectly clear and legible. It closes at 80 with the words scrawled in an almost unreadable 
trembling hand across the page. ‘I am now almost blind and with great difficulty I write this. 
Oh! the misery I feel, no one comprehends it.—-’? When was this man ‘‘ blind” for a census return ? 
The category ‘“‘ blind” covers a vast range of cases of partial sight, and it would be hard to draw a 
dividing line between ‘‘ blind” and ‘‘ normal” in such cases, 
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The number unable to articulate is negligible; all the deaf can make sounds and 
probably 90°/, could articulate with more or less success. Those who acquire 
deafness later in life retain their speech, but with impaired quality. The census 
returns club together the congenital and acquired forms. Where then is the 
“discrete” attribute? As for the blind so for the instruction of the semi-deaf, 
special schools have been established by some educational authorities. There were 
19 scholars reported on in the Bristol Education Committee’s Report for 1911 as 
attending the semi-deaf school at Broad Weir. From “very deaf” to “very slight 
deafness” we have every graduation of hearing from those that can only hear 
under 1 yard to those who can hear at 6 yards. There is every variation in 
speech from the “unintelligible,” “now beginning to use a little,’ “voice very 
weak,” up to “lisping nearly overcome” and “speech of good quality.” Speech- 
reading forms an essential feature of the instruction, and the cases are those 
of transitional deaf-mutism. 


We entirely disagree with Mr Yule’s statements that such attributes as blind 
and seeing, deaf and not deaf, mentally deranged and not deranged, “if not 
absolutely discrete, are very largely distinct from each other” (loc. cit. p. 638), and 
we wholly fail to follow his argument on this point. . 


Mr Yule has apparently seized on recovery and death in cases of small-pox as 
discrete instances, but by Dr Macdonell and one of the present writers they were 
used to measure a continuous quantity—the severity of the attack. The data due 
to Dr Brownlee, and published by one of us in Biometrika, Vol. vil. p. 256, show 
that when the severity of the attack is classed by such categories as haemorrhagic, 
confluent, abundant, sparse, very sparse, that variate is essentially continuous, and 
that the mortality is largely confined to the two highest classes. Again when the 
immunity conferred by vaccination is reclassed under “ unvaccinated,’ vaccinated 
over 45 years ago, 25—45 years back, 10—25 and 0—10, we at once recognise 
that vaccination regarded as conferrmg immunity is an essentially continuous 
variate. The same notion of continuity comes in, if we classify severity by the 
number of pustules on the face, comparison being made with a standard series of 
photographs of typical cases, or again show that area of vaccination scar affects 
the extent of the immunity. Mr Yule can hardly be ignorant of all this work, 
yet he lightly chooses the vaccination data as good illustrations of his method*. 
The classifications of intensity of attack by number of pustules and of vaccination 
by period since vaccination show in general frequency surfaces of a rough “cocked 
hat” type, and dispel at any rate the notion that immunity and severity can be 
treated as discrete variables. It is only the confirmed Mendelian who would 
classify any pigmentation character, whether of eye ‘or of coat or of flower, into 
two alternative groups. The intensity of pigment is undoubtedly a continuous 

* That he was, as long ago as 1899, acquainted with the varying intensity of vaccination is con- 
clusively proved by the following sentence from his memoir on ‘ Association’ (Phil. Trans. Vol. 194 A, 
p. 289) : “The association between non-vaccination and attack is very high indeed for young children 


—8 to °9—but drops sharply to 5 (owing presumably to the waning protection of the vaccination made 
in infancy) in the older age groups.” The italics are ours. 
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variate, when we consider the actual frequency of pigment granules, and it is 
only confusing the issue when eyes are divided into two classes, those with both 
posterior and anterior pigment, and those with only posterior pigment. We shall 
return to the question of eye-colour later, as it is a case out of which Mr Yule 
makes much capital, but which in his last paper is the only one he treats by his own 
association methods. If we take “ prickliness” in fruit, there is no evidence yet 
that it has ever been properly measured, and that it would not prove to be a 
continuous variate, much as “hairiness” proved to be in the case of Lychnis when 
actually measured by Weldon*. To sum up, there are not among the attributes 
used by Mr Yule to illustrate association coefficients any but those of sex and 
nature of fertilisation which can reasonably be considered discrete quantities, and 
in using even these he always couples them with characters which in our opinion 
are distinctly continuous variates. 


We shall therefore start this criticism of Mr Yule’s statistical investigations by 
indicating the fallacious nature of his coefficients of association and colligation as 
applied to continuous variates. We shall then deal with the question of discrete 
alternative variables, and show their absurdities in that case. Finally we shall 
show reason for questioning the details of the bulk of his memoir, which is not 
occupied with the discussion of his special coefficients at all, but in advocating a 
new empirical method which there is ample reason for considering equally fallacious. 


(7) On the Idleness of Mr Yule’s Coefficient of Association when applied 
to Continuous Variates, 


(a) The Need for either Knowledge or Hypothesis as to the Nature of the 
Frequency. 


Let us start from a very general case of two continuous variates. The first 
question that we require to answer is whether for a given value of one variate 
the mean value of the other variate changes. If we can find the mean values 
of the arrays of the second variate for given values of the first we obtain the 
regression line; should this be a straight line the correlation coefficient r is a suit- 
able measure of the relationship, if it be not then the correlation ratio » gives us 
a measure, and ?—7r? marks the deviation of the regression from linearity. In 
cases where the regression lines are both straight and r=0, it by no means 
follows that the two variates are absolutely independent. The next essential 
condition is the equal variability of the arrays, or what is nearly the same thing 
the probability of a combination of one variate « between w and «+ dx with the 
other variate y lying between y and y+ y is the product of the probabilities 

* Biometrika, Vol. 11. pp. 47—55. The danger of this sort of classification has come home very 
emphatically to one of the present writers, who had tried to use the category of ‘‘short-muzzle” 
as against ‘‘long-muzzle”’ in breeding dogs. He was convinced that ‘‘ short-muzzle”’ was a dominant 
character in the Mendelian sense, until he took actually to measuring the muzzles of the hybrids of 


first and later generations, when the idleness of treating such categories as discrete quantities became at 
once obvious. 
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of « lying between w and « + da and of y lying between y and y+ dy. This is the 
true measure of independence of the two variates. It is always possible to test 
the independence of two variates—i.e. the probability of their independence— 
by aid of the mean square contingency and the use of Palin Elderton’s 
Tables*. But there is a whole class of continuous variates for which the 
regressions are linear and the correlation coefficient is zero, but which are yet 
heteroscedastic, i.e. the arrays of the y-variate for a given w-variate are not similar 
frequency distributions. Any frequency surface, with two planes of symmetry, one 
perpendicular to the axis of measurement of each variate, is representative of this 
class. Let DEFG be the oval contour line within which, for a given population Y, 
all the frequency lies. This may be the actual curve in which the frequency surface 
cuts the plane of xy, or if the surface asymptotes to that plane, the contour within 
which all individuals of the N observed actually lie. If the distribution were 
Gaussian, this contour would be an ellipse. Generally let us suppose it any non- 
re-entering oval curvet. 


Let the frequencies in the four classes made by divisions parallel to the axes 
of the variates, 1e. to DOF and HOG, be represented by a Z which we have 
AB| AB 
aB| ap ° 

Then Mr Yule’s coefficient of association Q is (ad — bc)/(ad + bc); it is 
unnecessary at this point to consider Mr Yule’s coefficient of colligation o, 
which is only a special function of Q. 


used throughout in preference to Mr Yule’s more cumbrous 


All along HOG taken as one dividing line (the other will be perpendicular 
to it), Q=0; all along DOF also, Q is zero. All along the arcs HF and DG, 
Q=+1; all along the ares FG and DE, Q@=—1. These values are indicated by 
numbers in the diagram. If the frequency surface be non-re-entering, then when 
the axis of the rectangular dividing planes is taken anywhere in the quadrants 
EOF and DOG, Mr Yule’s Q is positive and varies from 0 to 1; if this axis be 


* Pearson, Drapers’ Research Memoirs, ‘‘On the Theory of Contingency,” p. 6 (Dulau and Co.), and 
Biometrika, Vol. 1. p. 155. 


+ If the curve be a re-entering one, the rapid value changes of Mr Yule’s coefficients are still more 
remarkable! 
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taken in the quadrants HOD and FOG, Q is negative and varies from 0 to — 1. 
Hence for two continuous variates, whose actual correlation is zero but which 
are not homoscedastic, we find Mr Yule’s coefficient will run through the whole 
range of possible values from —1 to +1, and what its observed value will be 
depends solely on where we take our dividing planes. 


It is perfectly true that Mr Yule’s Q vanishes if the variables are theoretically 
independent, but no variables are practically independent, and in actual statistics 
of two continuous variates when the grouping is fairly small we do not get DE FG 
a rectangle, the sole bounding contour for which Mr Yule’s coefficient of association 
is zero all round. ‘The fact that Mr Yule gets +1 or —1 for his coefticient round 
his boundary contour would be of small importance were it not that he appears 
to hold that, when Q= +1, then its probable error is zero. Round the bounding 
contour of a distribution of this kind Pearson’s normal coefficient has usually a big 
probable error, and the investigator is thus warned that its vagaries are of no 
account*. When the investigator comes, however, to a fourfold classification leading 
to Q@=+1 by the zero of one of its classes, he would, if he were to follow Mr Yule, 
assume his result absolutely reliable, and to be so independently of the total 
population used. As a matter of fact, his dividing lines may have given Q=+ 1 
solely because they chanced to be taken near the bounding contour of a frequency 
distribution, of which the investigator knows nothing! The real fact of the case 
is that Mr Yule’s investigation of the probable error of @, while correct as long 
as the frequency of any of his four classes is substantial, fails entirely when one 
of his four classes is zero, and is correspondingly in error when @ is very large 
owing to one of the four classes being very small. Even if Mr Yule determines 
the probable error of Q for such cases by higher approximations, it will be 
meaningless without consideration of the frequency distribution of @, which is 
an exceedingly skew curve for high numerical values of Q. 


Before we leave these cases of zero correlation it is worth while to indicate 


how @ works for various artificial frequency surfaces. 


(1) <A rectangular block. @Q is zero all round the boundary and for all 
possible divisions. 


iG 


(2) A square prism; diagonal planes parallel to the variate axes. @ is positive 
in two quadrants and negative in two, and takes all values between +1 and — 1 
according to the point of division. It is essential to note that Q does not, as 


* The matter is discussed more at length in Appendix I. 
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might be thought, give small values except in immediate proximity to the 
boundaries, it rises to quite substantial values when the percentage in any quad- 
rant takes a value which Mr Yule has not hesitated to use when criticising 
other methods of approaching the problem of association, and—according to 
Mr Yule—as it approaches these values the reliability of his coefficient increases ! 


Diacram II. Frequency surface of zero correlation exhibiting every possible variation of 
Q with different dichotomic lines. 
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(3) Let us now take a cylinder on circular base. Here again the correlation 
coefficient is zero, but Mr Yule’s coefficient runs through its whole range from — 1 
to +1, being negative in two quadrants and positive in the other pair. 


The above cases illustrate, we think effectively, the point that the coefficient of 
association tells us absolutely nothing, unless we make some assumption as to the 
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nature of the surface of frequency. For surfaces of zero correlation it must always 
take all values from +1 to —1 according to the position of the dividing planes. 


We will take one more of these zero correlation tables because it leads up to 
certain new points. Such a table as that given as Table VIII might well occur in 
practice; worked out by the product-moment method, the regression lines are 
linear and there is zero correlation. Table IX shows how extraordinarily the 


Diacram III. Frequency surface of zero correlation exhibiting every possible variation 
of Q with different dichotomic lines. 
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coefficient of association varies from point to point of division. It is not only zero 
along the axes, but zero along a contour line in each quadrant. Thus, starting from 
the centre of the surface, we descend in the first and third quadrants to a negative 
association —°45, then crossing the zero contour, we rise to a positive value of 
+50 and ultimately reach +87! In the second and fourth quadrants the process 
is reversed; we rise first to + ‘45, then sink to zero and descend first to —°50 and 


08 | 917! P48 | LAPT lta | reLz LOGE err | sere | 9669 6906 GLEZL OZOGT/90LLE/0ZOG6T|ELEZT] 6906 | 9669 | 88FE | ELFF | LOGE 7612 | 112 | L261 | FL | 91F \08 STeqOL, | 

OO eS ee eG IL | 8. I ; 

osg¢ JZ 1% |z CP NG Aa zg | 89 z ps 

ogst |@ | % |¢€ ee |te 169 FOL | BGG j Z 
: ogce Jz |e |¢ 89 |ZOT | eel Ice | 68€ € 5 
S recy |e |r |9 OIL | 6FL | IGT LVE | ZEs v GE Mle’ 
= 9819 Je | 2 I9L | 661 | Faz (OS? | €89 ye cents 
Ss prs |p | ¢ EFS | LES | OSE 1889 | 066 g ae 
‘Ss OS601 |F | OL 18@ | SE | Tbr TLL | 69TT | 09€% 91 Ayal eRe 
S O6EST JF | ¢ lor | 68F | O19 LOOT | FZ9L | FTE cr A Nes 
% GFIES |G 909 | 1g | 6&6 SZ9T E8FS | 996F 6L fe |e 
= OLOGe | 0z 6F9 | Z6L | 8001 LILI | 919% | 8TBS 88 ry labs 
4 | spec tc | 62 909 | 1gL | 6&6 8291 | E8FS | 996F | | 191 | 62 Goriliet 
S O6ecL |F | eF LOF | 68h | O19 LOOL FEIT | GhZE | FZ | lek 
om O860L |r | 9T L8G | Eee | lbh TLL | 691 | 09€%@ | 6 | 
~ rere [Pr | ¢ EFS | LOS | OE 88¢ 066 | O6LI 2 | Ge eo | We 
~ 9819 |e | 2 66L | Fa |OSh | €89 | FSET | | }9T | pod Bik 
& Pech |¢ |r 6rL | I6T | LbE 9COL |Z Bl Cian oo al 
< Ogte Jz | € (ZOL | €&T ISS 064 | A eg Chat cs all Se ae 
N osst Jz |z 69 “FOL 90¢ | zz | | | i 
2 os¢ Jz |G 61 eq O€T | al 1S | 2 
S ool JT | 1 ih 2 0g | 

STeq0 J, 


‘Meg JO Ioquioyy ISA 


TWA ATAVE 


198 


199 


Kart Pearson AND Davip Heron 


| 686: — 


998-+|7e9-+| Ogg. +] COF--+ | GFE +/ STE: +] L68-+| E8G- +] BSS. +| LES-+) STS +; — | O | — | BIG-—] LEG. —| BEG. — L6G: —| BIE: —| SPs — 
TTS. +| 20g: +| 062: +| TET- +] 2Z0-+| SFO- —|F80-—| IOT-—|9TI-- L60--| — |—]| 0 160: +| 9TT-+| TOT: +] #80. +] €FO- +] BZ0- - 
989. + | S9T- + | F80- — | 996. — | 098- — | TTF. — | BFF. — | 8OF- — | 6LE- ; 7 | lp — |6Lg-+| 80% +| hF- +] ITP +] 09e-+ 
CLP. + | 9F0-—| LGG- — | ¢88. — | SFF- — | LGF- — | GOP. — | FRE: — (Oe eee 2 PrE-+| GOP +| LOF-+| SFP. + 
88E-+/ SST — | 9G8-— | LUP- —| SSF — | 968-— | SFE--| — | 0 | GFE +| 96E-+]| ESP-4 
00. +| 0%: — | 6FE- — | 6LE- — | €98-— | 008-— |  — | 0 00g: +| e9g-+ 
€9G- + | 666: — | O9E- — | FFE- — | 88- — | | 0 = | 886: + 
SET. +| 866: — | Sk. — | F9B- — | | z 0 i 
| 0 | | 
0 0 0 0 0 0 0 0 oO 0 0 0 0 0 0 0 0 0 0 0 
| 
SET. — | 86Z- + | GZ. + | F9S- + | 0 
€9%- — | 666- + | 09E- +) FFE. + | 88B- 4 (slp a = = = = = | kae= 
008: — | G0%- + | 6FE- + | BLE. + | S9E- + | 008: = = fh ve = — | 00¢-—| g9¢: 
88E- — |SET- + | 9G8- + | LEF- + | ESF. + | 968- + | BFE. + | 0 — | 3Ppe-—| 968-—] ESF - 
GLF- — | 9F0- +| LGS-+| GQe- + | SFF- +) LUF-+| GOF- + | FFE. + | 0 | PPS: —| SOF-— | LP — | SFP — 
929. — | E9T- — | F80-+| 966+ | 098+ +| ITF. + | GFF. + | 8OF- +] 6LE-+) — = SP — 1|6L&-—|80F- —| SFR —| ITF —| 098: - 
TI8-— | 20g: — | 06z- — | Tet. — | 320- — | F0- +] F80-+| TOT-+|911-+/L460-+} — |—] 0 |—| — |L60--|9TT-—| TOT-—| ¥80-—| SF0-—| ZO. + 
998- - | F29- — | ogg. —| GOF- — | EFS-— | SIE: — | L6Z- — | E8B-— | GGS-— | LE%--|S1S-—-| — | O | — | BIG +] L8G-4-| GES. +] S8B- +] L6Z- +} ZTE-+]| SFE-+ 
: ees soca : 
Zz fh x n a ip | a $ r) Db a uw wi) 2 Y ih 2 uy 6 ! a Y 


‘S6L ‘@ uo umoys TITA 2190, 10f uoynwossy fo squavifaog sainx 


‘XI ATAVE 


py fo sanjoy 


200 On Theories of Association 


then to —‘87. What information can the coefficient of association give us as to 
the nature of these two uncorrelated variates? It is no doubt a measure in some 
manner of the heteroscedasticity of the arrays. But how and in what way does 
it measure this phase of want of independence by a value which varies from —1 
to +1 several times over? If it in some manner measures this heteroscedasticity, 
it is only by its local values, it measures nothing of the dependence of the variates 
asa whole. Will Mr Yule tell us how to infer whether, when Q=+°87 or —°87, 
it is a measure of the relation between the means of the arrays corresponding 
to given variates, or is merely a measure of the differences in the variabilities of 
those arrays? Will he also tell us in what manner, by a multiplicity of values, 
it measures mere heteroscedasticity ? 

Are we doing Mr Yule an injustice in taking any notice of Q at the extreme 
limits of our table, e.g. of such values as +°81 or —‘81? Well, consider the 
corresponding fourfold table: 


3 707 710 


77 | 174,295 | 174,372 Q=+°811 


80 175,002 175,082 


and multiply each entry by the factor 


32,527,843 __ 
we find 


557 | 131,351 | — 131,908 
14,306 | 32,381,629 | 32,395,935 


14,863 | 32,512,980 | 32,527,843 


Compare this with the fourfold table deduced by Mr Yule from the Census 


data of 1901: 
Blindness. 


= Present Absent Totals 

& 

o 

Q Present 132,096 132,654 ers: 

= | Absent 32,370,430 32,395,189 Q= + °693, 
~ 

B Totals 32,502,526 32,527,843 

= ae wae a a 


or the following one from the same Census, again formed by Mr Yule: 


Deaf-Mutism. 


P Present | Absent Totals 

ww 

wi 

@ | Present... 96 | 25,221 25,317 Q= 4-789 
3 | Absent... 15,150 32,487,376 32,502,526 

AQ 


Totals 15,246 | 32,512,597 32,527,843 
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and it will be obvious that we are doing Mr Yule no injustice at all. Our 3 in 
175,082 is 557 in 32,527,843 and is far higher than Mr Yule’s 96 in 32,527,843 ! 
Why his 96 in over 32 millions might almost consist of the persons who had 
been rendered at the same instant both deaf and blind by accident! Yet on the 
basis of this result Mr Yule has asserted a “very high association” between deaf- 
mutism and blindness! 


But, since Mr Yule has no hypothesis as to the nature of his frequency, why 
should not the relation between blindness and deaf-mutism be precisely like that 
of the nature of the variates exhibited in Table VIII? If this be so, what has 
Mr Yule’s coefficient of association told us? The variates would be actually 
uncorrelated, but we should anticipate : 


(i) That extremely bad sight would be associated with extreme deafness— 
this is the above Q=+°78. 


(1) Extremely bad sight would be associated with great aural acuity. This 
has often been asserted of the blind. Here Q would be high and negative. 


(ii) Extremely good sight would be associated with extremely good hearing, 
ie. Q would be high and positive. Persons with exceptionally good capacity of one 
sense very frequently have it of another sense. 


(iv) Extremely bad hearing would be associated with exceptionally good 
vision, Le. Q would be high and negative again. This is quite possible, although 
we have no conclusive evidence on the point. In a small school for deaf-mute 
girls 90°/, of the children were found to have normal vision (¢ or §), none 
had worse than ;%. In a group of children of normal hearing with light hair 
only 69°/, had normal vision and 11°5°/, with vision of £; or worse. London 
girls give 85°/,, Glasgow 82°/, and Edinburgh 80°/, normal vision—all lower 
values than in the case of our small sample of deaf mutes. 


Results (1)—(iv) would hold if there were no correlation between goodness 
of sight and hearing—the average sight of a very deaf person being the same 
as one of normal hearing—provided the variability in sight of the very deaf were 
less than that of the general population, and the variability in hearing of the very 
blind were also less than that of the general population *. 


Thus given a fourfold table which is based upon continuous variation, if we 
make no hypothesis with regard to the nature of the frequency, we have in fact 
no idea at all of what Mr Yule’s coefficients of association and colligation really 
measure. They measure in some form or another deviation from independence, 
it may be true correlation or it may be heteroscedasticity, and divisions taken at 
very slight distances apart may give hopelessly divergent values of Q, of which 
difference of values Mr Yule has given no intelligible interpretation. 


* Tn the case of our Table VIII, the variability of the horizontal character for the whole population 
=1:6686, and for the vertical character 1:5979. The variability of the combined two top arrays is 
16444, and for the extreme vertical column on the right 1°8166. Had the variabilities of the two sets 
of arrays been the same as those of the general population, the association would have vanished. 
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If we turn from surfaces of zero correlation to those of finite correlation, we 
find in the same way that Q takes innumerable values which have no mutual 
relationship. Heron has already demonstrated this as far as the Gaussian surface 
of frequency is concerned*. For example, in a Gaussian distribution Mr Yule’s 
association can take every value from ‘37 to 1:0 if the correlation be truly ‘3 
and the divisions be taken along the diagonal. These give for a Gaussian 
distribution the complete range of Q, but it by no means follows that this is 
true for other types of frequency. Mr Yule, however, makes no hypothesis as to 
the nature of his frequency surface. To test the kind of meaning Q conveys, 
suppose the frequency surface to be a rectangular block, length 2a, breadth 2b, 
height h—that is to say, within a given rectangular area the frequency of all 
combinations of the two variates is equally probable. If the block slopes with its 
side 2a at an angle of @ to the w-variate, we have the correlation 


(a? — b*) cos 6 sin 6 : 
Va? cos? 6 + 6? sin? 6 Va? sin? 0 + b? cos? 0 


Y 


The regression lines are built up of straight lines; for a considerable distance 
they coincide with the axes of symmetry, but are afterwards bent round horizontally 
or vertically as the case may be. At the four corners Q=0 and along one pair 
of parallel sides Q=+1, along the other Q=—1. Thus two contour lines of 
zero association pass through the corners pair and pair. In the accompanying 
diagram (p. 203) the values of Q are given for the special case where 0=45°, a=1°5b, 
and accordingly 7 = "3846. Going along the longer axis Q changes from — 1:0 to 
— ‘295 and so to zero, then it rises to +°6 at the centre and falls to zero again, 
becoming negative and ultimately concluding with —1:0 at the boundary. Along 
the shorter axis Q varies from + 1:0 to +6 at the centre and rises to +1°0 again 
below it. What can be learnt as to the real association of two variates by a 
coefficient which behaves in this way? The case would be quite different if 
Mr Yule had indicated a type of surface for which Q was constant for all 
divisions and demonstrated that it represented, even with moderate approxi- 
mation, such distributions as occur in statistical practice. One such surface of 
“stable association” at any rate is known for the tetrachoric 7 treated as a 
coefficient of association merely, and that surface is not widely divergent from a 
considerable number that we actually meet with. 


(b) The Fallacy of Mr Yule’s Selection in the case of Continuous Variates. 


If @ be not even approximately comparable with itself when taken on the 
same surface with different dichotomies, how can it be comparable as a measure 
of any real relationship from one surface to a second? Mr Yule will no doubt 
reply that a function of @ does measure certain percentages when the table 
is dressed in an equalised symmetrical form. Our reply is that that form has 
been obtained by a method of selection which makes very large changes in every 
other coefficient, including the Boas-Yulean, which has been used to measure 


* Biometrika, Vol. vii. p. 109. 
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relationship; that we dispute entirely the legitimacy of such selection which is of 
a singularly arbitrary character, both in the extent to which it is applied and the 
region to which it is circumscribed. If we apply it to continuous frequency 
surfaces, so that a certain Q remains constant, all other Q’s are changed, and r 
taken as a measure of the relationship of the variates as a whole is often 


Diacram IV. Q for frequency surface a right six-face. To illustrate how variation of Q depends 


on form of distribution and how it has no relation whatever to true correlation. Actual corre- 
lation ‘3846. 
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immensely changed. A correlation table has a multiplicity of Q’s and one product- 
moment 7; a process of selection, which changes all Q’s but one, and widely 
modifies r, has attributed to it by Mr Yule some special merit by which that Q, 
in preference to any other, is considered for the time being to measure the 
“association” of the variates ! 


26—2 
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Any argument, as we have already indicated, which is valid when applied to 
the columns and rows of a fourfold table ought to be valid when applied to the 
columns and rows of a multifold table. Such a table should also not be affected 
by selection. Well, let us take the Table from the Census of the age of husband 
and wife and let us select so as not to change certain Qs and see what effect 
this has on the correlation. The following tables give @ and r before and after 
a series of selections. 


Let the divisions be at the same ages for both husband and wife, say under 30 


and over 30. 
Coefficient of Association ='9745 


Coefficient of Colligation =*7958 


Correlation before Selection =°9136* 


Percentage of those over 30 years selected : 


Actual Correlation Actual Correlation 
100 °/, ‘9136 Sie “6969 
10°/, 8347 4°}, “6479 
Sale “8175 3°), 5921 
Saya “7961 hs ‘5360 
eal “7698 ais "4952 
6°/, ‘7369 Ome "4850 


In other words the selection which reduces the actual correlation from ‘914 
to 485 leaves Mr Yule’s coefficients of association and colligation unchanged! 
No, this is not true; every other coefficient of association and colligation for 
the table is changed, except the particular two for the arbitrary division at 
30 years! What legitimate inference of any kind can be drawn from the constancy 
of this individual pair ? 


Now let us select husbands and wives unequally but still at age 30 divisions: 


Husbands Wives Actual Correlation 
100 °/, 100 °/, 914 
10°/, over 30 10°/, under 30 908 
1°/, over 30 10 °/, under 30 850 
10 °/, under 30 1°/, over 30 “715 
1 °/, under 30 01 °/, over 30 "285 
000 °/, under 30 000 °/, over 30 — 009 
000 °/, over 30 ‘000 °/, under 30 — 038 


During all these operations which reduce the actual relationship as measured . 
by correlation from the very high value ‘914 to zero and even to negative values, 
Mr Yule’s association and colligation for his selected dichotomies show the constant 
“very high values” ‘975 and ‘796. At other dichotomies of course they cover 
pretty well the whole possible scale. 


* Value without Sheppard’s corrections, because in dealing with selection it is not clear that those 
corrections are always appropriate. 


On 
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Nor has the result anything to do with the division at ages 30. If we divide 
at 21 years we find: 


Percentage 
selected over 21 Actual Correlation 
100 °/, 914 
10 °/, ‘939 
ieee 941 
1°), 493 
000 °/, 272 


Here the Q and » of Mr Yule retain throughout the operations the values 
‘987 and °853, which mark, we have been told, “ very high association” ! 


Nor is the absurdity in the least confined to ages of husband and wife. Let 
us take stature in Father and Son* and divide into a fourfold at Fathers 67:5” 
and over, and Sons at 68°5” and over. The actual correlation is 520, Q=°683 
and » ='395, 


Selection of Father Selection of Son Actual Correlation 
100 °/, 100 °/, 520 

10°/, over  67°5" 10 °/, over 68:5” 314 

10°/, under 67:5” 10 °/, over 68°75” 275 
000 °/, over 67:5” 000 °/, over 68:5” 251 

1°/, under 67°5” 1°/, over 68°5” 185 


The result is exactly the same as before, the real relationship is immensely 
modified by selection, while the colligation and association remain unchanged for 
one pair of arbitrary divisions and for this one only. What can be learnt from 
such a statistical method? We venture to believe that from the standpoint of 
common sense it is wholly without meaning. 


What is the precise physical character which is to be attached to this wide 
difference between “association” and correlation? That correlation is atfected 
by selection we know only too well; it is one of the factors of progressive 
evolution under natural selection, but what profitable meaning of any kind 
is to be attached to the statement that one out of an indefinite number of 
associations has remained unchanged by a special selection? Does not the 
principle that “association” or “colligation” is unchanged by selection arise 
from the fact that Mir Yule has merely guessed a denominator to his coef- 
ficient, which denominator has no theoretical justification of any kind; and his 
principle that selection makes no change is a later discovery and has no validity 
at all, for it is not a “natural” selection and has no generality beyond the fourfold 
table ? 


We think we have sufficiently indicated that Mr Yule’s coefficients of associa- 
tion and colligation fail entirely for any variates which may be suspected in any 
way of continuity, and the bulk of the variates to which Mr Yule has applied his 
methods undoubtedly have such continuity. 


* Biometrika, Vol. u. p. 415. 
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Mr Yule would no doubt. tell us that he has distinctly stated that he distin- 
guishes between correlation and association and that he knows they may lead 
to diverse results. We reply that, wherever there is any real continuity, the 
assumption of a discrete “attribute” disguises its existence and will lead to 
erroneous conclusions. Further he directly states* that: “The methods applicable 
to the former kind of observations, which may be termed STATISTICS OF ATTRI- 
BUTES, are also applicable to the latter STATISTICS OF VARIABLES. A record 
of statures of men for example may be treated by simply counting all measure- 
ments as tall that exceed a certain limit, neglecting the magnitude of excess or 
defect, and stating the numbers of tall and short (or more strictly not-tall) on 
the basis of this classification. Similarly, the methods that are specially adapted 
to the treatment of statistics of variables, making use of each value recorded, are 
available to a greater extent than might at first sight seem possible for dealing 
with statistics of attributes. For example, we may treat the presence or absence 
of the attribute as corresponding to the changes of a variable which can only 
possess two values, say 0 and 1.” 


Here Mr Yule directly claims that his methods can be applied to stature, and 
in the next sentence suggests that it is reasonable to treat the difference between 
any tall man and any short man as unity because they have been placed under two 
class-indices “ tall” and “short”! He started his statistical work from the stand- 
point of the pure logician, and he does not perceive that he is applying his 
reasoning to the class-names of things and not to the things themselves behind 
these names. Let us take head length and head breadth with a correlation, 
say, of 50, and lengths of femur and humerus with a correlation, say, of ‘60, then 
it is perfectly easy by selecting your head length and breadth division and your 
femur length and humerus length divisions, to make the association between head 
length and breadth either greater or less than that between femur length and 
humerus length. What is the value of the coefficient of association as a measure 
of relationship if this be the case? Every new division gives a different ratio 
of association for the two sets of attributes. The application of such methods 
in practice can only tend to the detriment of modern statistical theory. 


(8) On the Application of Mr Yule’s Coefficients to Discrete 
Variates. Mendelism. 


May we not, however, accept Mr Yule’s claims for his coefticients of association 
when the classes differ by a real discrete unit—not a unit arbitrarily introduced 
by calling a “short” man 0 and a “tall” man 1? Well, the difficulty is to find 
such cases. However, supposing them to exist, then there is no question that the 
coefficients of association and colligation are not the right methods of approaching 
the problem, but that the ordinary product-moment correlation coefficient is the 


* Theory of Statistics, pp. 7—8. 
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correct method; but this will lead us to results absolutely opposed to those of 
Mr Yule’s association and colligation. 


We ourselves, however, doubt the existence of this discrete unit; we have 
only come across it in theoretical Mendelian investigations and doubt whether 
the “unit character” which is absent or present has any existence in somatic 
classifications. Of this we should like to give two or three illustrations. 


The first illustration we take is from a paper* by Professor E, M. Kast, 
entitled “The Mendelian Notation as a Description of Physiological Facts.” 
Professor East is a vigorous Mendelian making the very best defence he can 
of the Mendelian notation in the present parlous condition of Mendelian theory— 
which assumes the truth of the unity of the unit. The cross which Professor 
East cites is one between a “long” corolla and a “short” corolla race of Nicotiana. 
Speaking of other Mendelian investigations into size, Professor East writes of 
them: “No criticism could be made except that certain of the characters used 
varied considerably in the mother varieties and therefore were presumably not 
homozygous for all character factorst. This criticism is apparently answered by 
a recent investigation of the writer’s, as yet unpublished, where two species, 
Nicotiana forgetiana and Nicotiana alata grandiflora, were crossed. As seen by 
the table, the corolla length is very slightly variable in either species{, nor is it 
affected to any extent by environment, yet each species was absolutely reproduced 
by recombination in the /, generation.” 


TABLE X. 


Frequency Distributions for Length of Corolla in a Cross between Nicotiana 
forgetiana (814) and N. alata grandiflora (321). 


Class Centres in Millimetres 
Designation | Totals 


20 | 25 | 80| 35 | 40 | 45 | 50 | 55 | 60 | 65 | 70| 75 | 80 | 85 | 90 | 
314 OM4\ 133, |-28 = ee —| 170 
321 = —j} 1] 19] 50| 56|-32! 9| 167 
Gisele | = | =.) 3.) 30! 58] 20:| — | | Li 
(314x321) F, | — | 5| 27 | 79 | 136 ee ee te 64/30/15] 6| 2|—| 828 
| | | 


Now let us call short all “below 60 mm.” and long all “above 60 mm.” We 
have all the offspring “below 60 mm.” Hence there is “dominance” 
corolla, and we may apply the magic formulae: 

(DD) x (RR) = 4 (DR), 
(DR) x (DR) = (DD) +2(DR) +(RR), 

* The American Naturalist, Vol. xvi. p. 639. 

+ Note the writer’s interpretation of the results by the preconceived theory ! 

+ The author does not tell us how many plants were grown in each generation. In the parent 


generation presumably only two. How many were considered in F; and I)? The difference of 
variability would largely turn on this factor. 


of short 
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with the result that the segregating generation F, shows 222 (RR)’s, a very 
plausible Mendelian quarter. 


Prof. East does not do this, although it was a step absolutely compatible 
with current impressionist classifications by Mendelians. All he says is that 
“each species was absolutely reproduced by recombination in the F, genera- 
tion” and “I do maintain that the Mendelian notation satisfies the facts of 
size inheritance as well as it satisfies the facts of qualitative inheritance” 


(lc. p. 689). 


Well, the fourfold Mendelian table of somatic characters would give in the 
F, generation (RR) = 222, (DD) + (DR) =606*. Now will anything be discovered 
by assuming that those two groups differ by a unit? This is the “dichotomy” of 
Mr Yule’s association. We contend that, while theoretically (RR) differs from 
(DR) by having no D as against one D and from (DD) by the latter’s having 
two D's, this theory is idle when pushed into actual Mendelian statisticst. The 
division at 60 is not a dichotomy of things differing by a unit, except in name, 
but an arbitrary cut across a continuous distribution, and the application of either 
the Boas-Yulean ¢@ or such coefficients as those of association and colligation is 
entirely misleading. We are told that “short” is “dominant” over “tall” as 
a result of experiment, and the importance of dividing at 60 mm. to get our 
tables is dropped out of sight. 


Piebaldism is another unit character of the Mendelians, and doubtless 
Mr Yule would be content to take his dichotomy between piebald and whole 
colour. Now here are 2314 mice classified according to the ratio of pigmented 
area of coat to whole area of coat?. Where shall we make the division between 


Total | 10, *15 | 20, -25 ‘40, 45.| 50, 55 | -60, 65 | -70, °75 | -80, -85 | -90, 96 ‘975 | 1:00 


ee ee 
2314 | 6 | 31 | 60 | 81 | 126 | 11 138 
| i ee | | 


176 79 393 | 1113 


the piebald and the whole colour? The group ‘975 largely refers to individuals 
with a very small white area on belly; but as a matter of fact the 1:00 group 
has been divided into three sub-groups of individuals who, without having whate 
areas on belly, have or have not somewhat lighter pigmented areas there. Where 
is the true dichotomy, especially when we can show that each grade of pie- 
baldism is hereditary? Mr Yule would no doubt apply his association to 
piebalds and whole colours as giving a Mendelian unit, but in doing so he 
will be applying his dichotomy to words, to class-indices and not to the real 
things represented by them. 

* Whether this means anything or not would of course depend on the 105 at 60 never in later 
generations giving anything below 60! 

+ Take the division at 85 mm. and we have dominance of long corolla, but the Mendelian quarter 
now fails. 

+ From the late W. I’. R. Weldon’s mice data now at press. 
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Again, an illustration from Mendelian dichotomy may be found in a paper 
by Hurst entitled “Mendel’s ‘Law’ applied to Orchid Hybrids*.” He desired 
to give a proof that the F, generation consists of 50°/, of (DR)’s and 50°/, 
of “specifics,” (DD)’s and (RR)’s. He recognised that the first cross gave an 
“intermediate,” so he defined his (DD) as all those, which show ? (DD) character 
and more, his (RR) as all those that show 2(RR) character and more, and the 
“intermediates” or apparently the (DR)’s all those that show character between 
i (DD) and ?(RR). As a result his “specifics” came out as 2281 and his 
“Intermediates” as 2267 in number, a plausible Mendelian 1:1 ratio. Thus the 
classification into every one of the groups (DD), (DR) and (RR) in the F, 
generation is by trisection of a continuous variate at arbitrary valuest. 

Pearson has come across an exactly similar instance of the vagueness of the 
Mendelian unit in breeding dogs. If a short-muzzled dog be crossed with the 
long-muzzled dog, the hybrid would be described by general impression, and was 
so considered by him, as short-muzzled. The result was to indicate dominance 
of the short-muzzle. But when muzzle indices were formed and the dogs’ heads 
measured in a variety of ways, the hybrids were found to be intermediates, and, 
crossed in again with the short-muzzled stock, they gave a group the mean of 
which had a position intermediate between the hybrid and that original stock. 
Each generation had very considerable variation. Dichotomy giving Mendelian 
ratios was possible, provided an arbitrary division was taken across the con- 
tinuous distribution. Mr Yule’s unit difference, short-muzzle—not short-muzzle, 
would be a perfectly idle one across what in the F, generation is a continuous 
distribution. 

One of the most remarkable features, indeed, of the present situation is the 
assumption that in some mysterious manner Mr Yule’s coefficients of association 
or colligation can be applied to Mendelian results. Mr Sanger, in his contribution 
to the discussion, said: 

“One additional reason why he weleomed the Paper was that the rise of 
Mendelian biology had made a great difference. There they were always dealing 
with things which were discrete, whereas according to all Galtonian laws they 
always dealt with things which were thought to be continuous. At present there 
was this difficulty that mathematicians had a prejudice in favour of more elegant 
mathematics, and the Mendelians had not yet learnt algebra; but that day would 
come, and then Mr Yule’s work would be the work for the Mendelians” (J..R.S.S. 
Vol. LXXv. p. 646). 

Mr Yule nowhere repudiated this application of his coefficients, and yet they 
are the last which can possibly be applied to Mendelian data! We put on one 

* Journal of the Royal Horticultural Society, Vol. xxv1. Part 4. 

+ Martin Leake has found continuity in the F, generation with an intermediate mean in the case of 
Indian cottons, Journal Asiatic Soc. of Bengal, N.S. Vol. 1v. p. 13, 1908. Even more astonishing 
frequency distributions for the Fy generation for ‘‘talls” and ‘‘ shorts,’ number of nodes and lengths 
of internodes may be obtained from Mr R. H. Lock’s ‘“‘Stndies in Plant Breeding in the Tropics,” 
Annals of Royal Botanic Gardens, Peradeniya, Vol. 11. In these cases it is wholly impossible to speak 


of a unit difference between the members of either class. 
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side what Mr Sanger can possibly mean by “all Galtonian laws” dealing with 
things thought to be continuous—he has clearly never read the treatment of eye- 
colour in man nor of coat-colour in Bassett hounds by Sir Francis himself, who 
distinctly treated them as discrete quantities and applies “his laws” to them. But 
we must indicate the fallacy of applying coefficients of association and colligation 
to Mendelian characters. The reason for applying them is the assumption made 
that a Mendelian character is a discrete unit. But if this be so, fourfold, and 
three by three Mendelian tables should be treated as discrete tables and true 
product moments formed of them. We believe that one of us was absolutely the 
first to apply these methods, treating the Mendelian theoretical characters as 
units*, and his work has been followed up by a whole series of workers in the 
Biometric Laboratory}. There was thus no question with the Biometric School 
of how Mendelian theoretical problems should be dealt with, and Mr Yule wholly 
misses the point when he states that Dr Snow’s “recent comments in Biometrika 
on the use of the normal coefficient for Mendelian tables in Dr Brownlee’s paper” 
were “a much stronger condemnation of Professor Pearson’s than Dr Brownlee’s 
work” (J. B.S. S. Vol. Lxxv. p. 651). Pearson has never used a normal corre- 
lation coefficient on a true fourfold tablet, which he believed to be Mendelian in 
character. He has only applied such coefficients when he believed the character 
under consideration to be at bottom continuous, and as far as eye-colour is con- 
cerned, the many dissections of eyes he has been able to examine in his recent 
investigations as to albinism have confirmed rather than weakened that stand- 
point. But even had he done so, although the use of such a normal coefficient 
might be criticised on the ground of the labour involved in determining it, it cannot 
be condemned on any other ground, for it is in all respects as good a coefficient of 
association as Mr Yule’s Q or , and possesses the important property that it is 
subject to selection—in opposition to the wholly fictitious merit which Mr Yule 
claims for his coefficients, namely that they are uninfluenced by selection. 

This point is so well illustrated by Mendelian theoretical tables, that we stay 
to demonstrate it here. Let us consider the correlation of father and offspring 
when a population represented symbolically by the fathers 

L(AA)+m(Aa) + (aa) 
is crossed at random with a population of mothers given by 
l’(AA) + m' (Aa) +’ (aa). 

Here we can put V=l+m+n= total of fathers, l’+m'+n'= WN’ = total 
mothers, and the fundamental Mendelian formulae 

(AA) x (aa) = 4(Aa), 
(AA) x (Aa) = 2(AA)+2 (Aa), 
(Aa) x (Aa) = (AA) +2 (Aa) + (aa) 
* Phil. Trans. Vol. 203 A, pp. 53—86, and R. S. Proc. Vol. 81 B, p. 225. 
+ Jacobs, R. S. Proc. Vol. 84 B, p 23; Snow, R. S. Proc. Vol. 83 B, p. 37, and Biometrika, 
Vol. vir. p. 420. 


+ The error of Dr Brownlee’s work was that he went back on all this and applied continuous 
methods to theoretical Mendelian tables; see Proc, Roy. Soc, Edin. Vol. xxx. p. 473. 
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are assumed to hold. If every possible father be mated with every possible 
mother so as to insure random mating, we have the contingency table for father 
and offspring : 


Father. 
Totals 
(AA) 2 (2U' +m’) m (20 +m’) 0) (24m) (2U +m’) 
4 1, 2 Am’ (1 - 
& 21 (im’ + 2n') | 2m (U +m’ +n’) 2n (21 +m’) a aa an is He) 
iS 6 a ae : 
m (m’ +27’) Qn (m' + 22’) (m +2n) (m' +2n') 


| 4m (U'+m' +2’) 4n (U+m'+n’) 4(d+m+n) x (+m +7) 
| 


4 (U + m' +n’) 


Now there are four points at which we can make a fourfold classification, a, 8, 
y and 6, and these will give tables for which the association and colligation in 
Mr Yule’s sense can be calculated. Now we will suppose 


N=l+mt+n and N’=l'4+ m+n’ 


to remain constant, so that the total population is the same. Then if we divide 
at a no selection of (AA) or “dominant” fathers will affect the coefficient of asso- 
ciation ; if we divide at 8 or 6, Mr Yule’s coefficients are unity, or there is perfect 
relationship between parents and offspring, and if we divide at y, no selection of 
recessives will affect the Yulean association. What light can Mr Yule’s coefficients 
possibly throw on Mendelian inheritance, when for two possible divisions they 
make the parental relationship perfect and for the other two they give substantial 
values of the relationship, but render it completely independent of selections, 
which in reality widely influence the relationship of parent and offspring, if we 
proceed by the theory of discrete units? If we accept—which the present writers 
do not—the theory of dominance and assert that (AA) and (Aa) are somati- 
cally identical, and represent one somatic character, then y is the only reasonable 
division to make, and the typical Mendelian table becomes: 


Father. 


(AA), (Aa) (aa) Totals 


2 (AA), (Aa) | 4(l4+m) NV’ —m (m' 427’) | 

B 

a 

5 (aa) m (nm +2n’) an (m' + 2n') 
Totals 4(¢+m) N’ 4nN’ 


27—2 
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We have at once from this table Mr Yule’s coefficient of association 
=== NTA +m) 
© N’ (21+ m) + m (QU +m’) 


But the correlation r, as found by the product-moment method for discrete unit 
characters, 1s 


———— Vin Vm! + Qn’ (21 + m) pee 
Vin 2) (Lm) [NN = (on + Bn) (nl + BW’) 


Now examine Mr Yule’s coefficient : 


(i) There is perfect association, Q=1, when we take no heteromorphic 
fathers (m= 0). 


L(m' + 2n’) 
Th c ] C = Ni 
evalu of 2UN' +n (2l' + m’)’ 


and takes, as it should do, all sorts of values according to the nature of the 
mothers, and the proportions of true dominant and true recessive fathers in 
the whole paternity. Thus, for all fathers recessive, it is zero, and for all fathers 
true dominant (n= 0) with all mothers purely recessive (J’ =m’ =0) it is unity. 
Thus the true correlation under the influence of selection can take every possible 
value while Mr Yule’s coefficient gives perfect association throughout ! 


(ii) There is perfect association, @ = 1, when we take only recessive mothers, 
Len — 1 0, 


rm n(2l+m) 
The value of r = se ii an) n ma)? 


and depends entirely on the distribution of fathers. When there are no hetero- 
morphic fathers (m= 0), it is unity. When there are no recessive fathers (n=0), 
it is zero. That is to say, while Mr Yule’s coefficient shows perfect relationship 
throughout, the true correlation or real association can run through the whole 
range from zero to unity. 


It is not too much to say that those who suggest that Mr Yule’s coefficients 
of association and colligation will be of service in Mendelian problems cannot have 
had any acquaintance with the nature of those problems at all. Mendelian theory 
relates to discrete units and every coefficient which is uninfluenced by selection is 
on that very ground wholly unsuited for use with such units. Selection modifies 
correlation when we deal with discrete units just as much as when we deal with 
continuous characters, and any coefficient is valueless which directly starts with 
the property that it will not be modified by selection. 


Mendelian practice classifies under unit designations individuals which, as we 
have just indicated, often show no sharp line of division at all. In such cases to 
treat the difference of two classes as a unit is juggling with class names, not 
dealing with the things so classed. Mr Yule has failed in these matters because 
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he starts from the field of pure logic and not from the observation and record of 
actualities. Even if the actual Mendelian differences were units, not the differences 
of continuous variation, then ¢ would be the right coefficient to use, not those 
of colligation or association*. But even here the results will be often difficult to 
interpret. In the usual case, however, of Mendelian practice, what we need is 
not the value of a correlation, but an investigation of whether observation is a 
reasonable fit to theory, i.e. we must use the ordinary “Goodness of Fit” test. 
This point is discussed in Appendix II, as there has recently been some mis- 
interpretation of the matter. 


* We have taken the series of those symmetrical fourfold tables for which ¢ has always the Mendelian 
value 1/3; the values of Q range from ‘6 to 1. What interpretation can association give of such 
Mendelian tables? 

+ The evil done by Mr Yule’s preaching of association to the neglect of more general methods is 
manifest in a recent paper by G. N. Collins in The American Naturalist, Vol. xuv1. p.572. He gives 
such numbers as the following for flower colour and long pollen in hybrid sweet peas, taken from 
Bateson, Saunders and Punnett, Report III to the Evolution Committee, p. 9, 1906. The calculated 


PURPLE Rep WHITE 
=a cn a = 
Long | Round | Long | Round | Long | Round 
| | 
| | > | aa) | 
Observed 1528 106 117 381 | alates) 394 
Calculated 1448-5 | 122-7 | 122-7 at | 4015 | 12205 | 407-4 


numbers are curious; the authors do not explain adequately how they have obtained them. Assuming 
them to be correct—but of this we have doubts—the problem proposed by Mr Collins is to determine 
whether the observed ‘‘ Purple” and ‘‘Red” as distributed into ‘‘ Long” and “Round” are a random 
sample from the calculated values, i.e. we compare 


Observation ca |) eilo28: 106 healt | 381 | 
Mheoryee es 4 |) 14455) || 1297 | 129-7 |) 401s 


Mr Collins remarks ‘‘No method has been proposed for making definite comparisons between such 
series of numbers” (p. 572), and continues ‘‘ A customary and direct method of comparing the degree 
of relationship that exists between any two characters is to compute the coefficient of correlation or 
Yule’s ‘coefficient of association.’’’ In the discussion which follows Mr Yule’s ‘coefficient of asso- 
ciation”? (1900) is used. Considering the work of Pearson and Elderton on ‘‘ Criteria of Goodness 
of Fit” (Phil. Mag. Vol. u, 1900, pp. 157—175, and Biometrika, Vol. 1. pp. 155—163), Mr Collins can 
hardly have gone far in statistics, for how would he have proceeded had he included the ‘‘ White” 
in his series? The proper method appears to us the general one, i.e. to determine the probability 
P of the recorded divergence between observation and theory, calculating 


(observation — theory)? 


2=sum 
x theory 


’ 


and deducing P by Elderton’s Tables: see our Appendix II. In this case a deviation as large as that 
observed would only occur once in twenty-one trials or the odds are 20 to 1. But we believe Messrs 
Bateson and Punnett have done themselves injustice. We do not write this in disparagement of 
Mr Collins’ work; he is undoubtedly right in demanding some test for ‘‘ goodness of fit” in these 
luxuriant Mendelian formulae. 
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(9) On the Limitation in Value of the Boas-Yulean ©. 


Given two total variate frequencies, if we can assert nothing of the nature of 
the distribution, the maximum value of the uncorrected mean square contingency 
coefficient depends on the number of cells and cannot for a finite number of cells 
exceed a certain limit. Mr Yule has spoken of this fact as if it were a serious blot 
on the method of contingency. We do not agree with him, but it is singular that 
if he thinks so, he should not have rejected the use of ¢, the “theoretical value of 
the correlation.” The fact that @ had a maximum limit was known to Mr Yule*, 
yet he never throughout his paper refers to it as detrimental to his own “ theoretical 
value of the correlation.” Consider any table: 


Wy7t 2,2 
V1 4 dale Tae | ah 
NN iV | 
NQM, Ng9M~g 
Vv —2 V +7 | Ny 
wae 
mM My | N 


This is the most general form the fourfold can take for given 7, m., 7m, m. We 


then have ae 
6 = aN /Jnynm, my. 


This will be a maximum of a positive kind when a takes the largest possible 
value, Le. when « is equal to the lesser of n,m./N and n.m,/N. It will be a maximum 


of negative kind when « is equal to the lesser of WE and eae Thus ¢@ always 


lies between definite limits which may be most restricted. 


Consider the table 
2269+) 97261 -—2x 99530 


ll-w# 459+2 470 


2280 97720 100000 
Here the limits are given by e=11 and « =— 459 or we have the two tables 
| | | 
2280 | 97250 | 99530 and 1810 | 97720 | 99530 
0| 470 | 470 470 o | 470 
2280 | 97720 | 100000 2280 | 97720 100000 


These give ¢='0106 and ¢ = — ‘4499. 


* Journal of R. S. Soc. Vol. uxxv. p. 604. 
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Mr Yule’s association coefficient is for the two cases 
Oe andee ia. 


In the first case the tetrachoric 7 is +10 and in the second tetracheric 
7,=—1:0, since h=2°0 and k=2°6*. As another illustration take 


5000 0 5000 


228 | 5000 


4772 


9772 | 228 | 10000 


This table gives the least positive @ for the given marginal frequencies; and 
with these frequencies @ can never rise above the value + °1527 which it has for 
this table. But both Q and tetrachoric 7, = + 1:0. 


The minimum value of ¢ for the same total frequencies is given by the table 


4772 | 228 | 5000 


| 
5000 | O 5000 


9772 | 228 | 10000 


This has @ = — ‘1527, while Q = — 1 and tetrachoric r,=— 1. In other words, 
@ is restricted to lie between +1527 and — 1527, while @ and tetrachoric 7, may 
pass through the whole range +1°0 to—1:0. Why has Mr Yule not pointed out 
these facts when recommending ¢ as obviously suitable for all fourfold tables when 
we get rid of normal variation? Clearly, if he had done so his criticism of 
uncorrected contingency would have been shown to apply with still greater force 
to his own coefficient. The standard deviations of the variates of the fourfold 


a|b : V(a+c)(b+d) (GUC Oy a 
table all are not given by V “A and N A’ and the 


ad — be 
ie 
points at distances A and A’ which Mr Yule takes as units. But it is clear that 
when we have done this we (i) have fixed the standard deviations of the variates, 
(ii) can shift our dichotomic lines throughout the whole ranges of A and A’ with- 
out influencing the result. If the variates are really not concentrated into points 
their standard deviations are wholly independent of the dichotomic lines, and 
every shifting of those lines will change the proportions of each variate falling into 
the two categories. The independence of the standard deviations of the dichotomic 
lines is the advantage of the tetrachoric 7; over ¢; and in practical Mendelian 
statistics 1t is Im most cases impossible to shift the dichotomic lines without 


product moment by AA’, unless we may concentrate each variate into 


* Actually h=1-999087 and k=2-597180. As we have already indicated and shall further emphasise 
in Appendix I, the value of tetrachoric 7, is indeterminable by the usual method in such cases, 
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modifying the frequencies. This is the ground we have had for applying ¢ to 
theoretical but not to practical Mendelism*. 


A further illustration of this limitation of @ for given marginal frequencies of 
the fourfold is provided in the accompanying Diagram V (p. 217). Here the range of 
values possible for the Boas-Yulean is given for the special case where one variate 
has a median division, and the percentage at which the dichotomy of the other 
variate takes place is given on the horizontal line; for example, for a 10°/, dichotomy 
¢ must lie between + ‘3333. We do not ourselves lay stress on this limitation of 
the range of values in the Boas-Yulean, but if it be a defect of the coefficient 
of mean square contingency that for a fourfold table its value cannot exceed 
‘707, it is also a defect of the coefficient recommended by Mr Yule that it also has 
a limited range for given marginal frequencies, a limitation not shared by the 
tetrachoric coefficient or even Mr Yule’s coefficient of association. 


(10) The Coefficient of Contingency. 


We do not propose to take up at great length a defence of this coefficient 
because one of us has had for some years a memoir on the subject in hand which 
will soon see the light of day. But Mr Yule’s criticisms arise from two sources, 
(1) from his disregard of corrections which practice has taught us were needful and 
which have been known for some time, (ii) from his obvious want of that con- 
fidence in the method which arises from long experience of its applicability. 


The corrections needed are (a) those due to number of cells, and (b) the 
correction for class-index. If « = number of rows, \X = number of columns, then on 
the average of many random samples the correction for number of cells is 


* We have the following results for the small-pox data : 


Possible range of @ 
Boas-Yulean ¢ for given frequencies 


Sheffield sa ae a +531 +'9181 to —°1221 
Leicester ie nah *249 +:2806 to — +2228 
Homerton-Fulham ... = *423 +°8101 to —:2301 


How would Mr Yule compare these values of @ with each other or with those of 7 from continuous 
frequencies, which can range from —1 to +1, or again with a Boas-Yulean ¢ from such tables as 


499,200 800 500,000 499,988 12 500,000 
498,306 | 1694 500,000 499,987 13 ; 500,000 
997,506 2494 . 1,000,000 999,975 25 1,000,000 3 
p= +02, = +0003, 
Possible range +:05 to —:05, Possible range +:°005 to — -005? 


The bulk of the mental defect and blindness data considered by Mr Yule has for ¢ total possible 
ranges varying from ‘4 to ‘6 on the positive side and ‘006 to ‘004 on the negative side. How can 
the resulting coefficients be intercomparable ? 
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(« —1)(A—1)/N to be subtracted from ¢?. This is the chief but is not the only 
correction for number of cells. It is, however, the one of most importance for our 
present purpose. It must only be applied when our material can be looked upon 
as a random sample. It should not be used of course when our material 1s an 
actual theoretical frequency surface, and not a random sample from such a surface, 
The second correction is for the use of class-indices in grouping. The theory of 
this correction is discussed in an earlier paper in this number of Biometrika 
(Vol. 1x. p. 116), where it has been detached from the memoir in preparation on 
contingency in order to indicate certain fallacies in Mr Yule’s statistical theories. 


Diagram V, Maximum and minimum yalues of ¢ (or 7,,) for a 50°/, division of one category and 
various percentage divisions of the other. 
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The area inside the curved figure contains all the possible values of ¢. 


Each variate must be corrected independently for the use of broad categories, by 
calculating the correlation of the variate with its class-index. In order to test the 
efficiency of the coefficient of contingency for a variety of groupings an arbitrary 
series of groups must first be selected to work upon. We choose the groupings of 
the eye-colour data published by Pearson and Lee for father and son as being 
perfectly arbitrary groupings fixed before any controversy arose on this subject 


Biometrika 1x 28 
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and therefore clearly not selected to obtain favourable or unfavourable results. 
These frequencies are 


i |e 3 4 


\ ir | 1 
| | | 5 6 | 7 | 8 | Totals 
| p= | : 
| Father... | 36 | 322 | 264] 180 | 5 | 64 | 101 | 28 | 1000 
a 34 Ba 984° 187 |) 5. | 100.4) 798 | 41 | 1000 
| 


The reader should remember that the eye-colour groups are as follows : 


1 = light blue, 
2 = blue, dark blue, 


3 = blue-green, hazel, 


5 = light brown, 
6 = brown, 
7 = dark brown, 


4 = dark grey, hazel, 8 = very dark brown, black. 


Group 5 has been usually clubbed with group 6 because in 5 only five fathers 
and five sons occur. 


The following table gives the class-index correlations for various groupings : 


TABLE XI. 
Table of Class-Indea Correlations *. 

Order of Divisi Class Correction | Class Correction | Corrective Divid- 
Table tvig10Ns Fathers Sons ing Factor 
Ist 7x7 Te 2st ASD kOe (ests 9010 ‘9011 *8118,9110 
Qnd 7x7 | 1:2:3:4:546:7:8 ‘9624 "9645 *9282,3480 
Ist 6x6 1:243:4:546:7:8 | “9009 ‘9010 °8117,1090 
|} Qnd 6x6 | 1:2:3:4:546:748 "9542 9553 91154726 
Ist 5x5 | 1:24344:546:7:8 “8070 8352 *6740,0640 
2nd 5x5 14+2:3:4:54+6:7+8 "9259 “9296 8607, 1664 
Ist 4x4 1:24+34+4:546:7+4+8 ‘7971 *8246 *6572,8866 
and 4x4 14+2:344:54+6:7+8 9054 | 9144 8278,9776 
3rd 4x4 14+2:3:4:5464+7+8 "9156 9130 *8359,4280 
Ist 3x3 14+24+3:4:5464+7+4+8 8253 8191 *6760,0323 
2nd 3x3 14+2:34+4:5464+748 "8949 *8975 °8031,7275 
3rd 3x3 1:24+38+44+4+54647:8 “5669 5968 3383,2592 


The reader should note that while the corrective dividing factor on the whole 
gets smaller and smaller as the classes get fewer and fewer, yet it is not possible 
to assert a priori that more classes will have a higher factor than fewer classes, 
e.g. our Ist 7 x 7 table has a lower corrective factor than our 3rd 4 x 4 table, and 
our 2nd 3x3 table a higher factor than our Ist 5x5 table. This point is 
important in reference to a criticism we shall make later of Mr Yule’s statistical 
methods. 


* The justification for obtaining the class-index correlation by determining the mean by the 
Gaussian hypothesis is given later in this paper; see also Biometrika, Vol. 1x. pp. 127, 139, ete. 
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Taking this system of classification we divided up a Gaussian surface of 
‘5 correlation into the same groups, and also a Gaussian surface of ‘3 correlation. 
We publish these surfaces below *. 


TABLE XII. 


Gaussian Surface for r= "5 in Hye-Colour Groupings. 


Totals 


Xr 
Co R$ BS Go DH 
D> 


Totals 


TABLE XIII. 


Gaussian Surface for r='3 in Hye-Colour Groupings. 


54+6 He 8 Totals 


34 
301 
284 
137 
105 

98 

41 


In applying the method of contingency to these two tables, no correction for 
mean of ¢? in the random sample should be made; they are actual surfaces and not 
random samples from these surfaces. Further in order to measure what effect 
dealing only with round numbers in the cells would make we replaced the first 
table by the following Table XIV in which the decimals were cut off and a slight 
adjustment made to preserve the total variate frequencies. This table is pub- 
lished so that the reader can judge on what we worked. 


This working to units cannot be expected to give quite as good a result as 
working to two decimal places, but it is more consonant with an actual table. 


Next we took Pearson’s Family Data cards and arranged 1000 cases of Father 
and Son, first in order of magnitude of Father’s stature, next in order of Son’s 


* Mr Yule states that he has divided up the -3 Gaussian surface in a somewhat similar manner, but 
he does not publish his table, and it is therefore impossible to test his results. We should like here 
to enter a protest against this procedure, which recurs in Mr Yule’s memoir, and throws an immense 
amount of unnecessary arithmetic on any one traversing Mr Yule’s arguments. 
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TABLE XIV. 


Gaussian Surface for r=*5 adjusted to give whole units in cells. 


| of 2, 3S 4 5+6 7 8 Totals 
1 7 20 5 2 = = = 34 
2 oi 145 79 36 10 9 1 301 
eh 6 94 85 54 19 92 4 284 
J 2 32 39 31 12 yy 4 137 
5+6 = 18 28 25 11 18 5 105 
7 — 11 22 24 ie, 22 7 98 
| 8 = 2 6 | 8 5 13 7 4] 
Totals 36 322 264 180 69 101 28 1000 


magnitude of stature and divided up the frequencies of Father’s and Son’s stature 
The following contingency table resulted. 


TABLE XV. 
Stature of Father and Son in Eye-Colour Groups. 


exactly in the eye-colour groups. 


Stature of Father. 


5+6 


= 
— bo CO Orb 
WO MOT Sw 


Stature of Son. 


Totals 1000 


We have thus one table which for practical purposes is absolutely Gaussian, 
one Gaussian table modified to give units in the cells, and one table typical of 
what occurs in, perhaps, 9 out of 10 cases in every-day statistics. 


The next table gives the results obtained by the method of mean square 
contingency with the appropriate corrections. It will be seen that 3x3 tables 
give as good results as 7 x 7 tables and the method is thus justified for Gaussian 
material and for the bulk of such tables as occur in statistical practice. 


It will be seen not only how closely the mean of the contingency values 
agrees with the product-moment value of the correlation, but how little the 
individual values differ from the mean. Worse cases may possibly be found by 
those who go to seek them by extreme divisions—we have taken them as they 
came, and feel convinced by a wide experience that contingency gives in practice 
remarkably satisfactory results. 

We now take a step further and ask,if we depart from the ordinary run of cases 
and pick out skew distributions, can we place equal reliance on the contingency 
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Stature Gaussian Gaussian 
Ord emote vable Clnsses Grouped Father and Son | Surface r=0°5 | Surface r=0°3 
2nd 7x7 Ie Bey ol sae 7 ats! “49 ‘49 “30 
2nd 6x6 1:2:3:4:54+6:7+8 “49 “48 | 30 
2nd 5x5 14+2:3:4:5416:74+8 52 ‘48 “31 
2nd 4x4 14+2:34+4:54+6:74+8 52 49 30 
3rd 4x4 14+2:3:4:546+7+8 ‘51 ‘48 30 
2nd 3x3 142 :34+4 :5+64+74+8 51 “48 31 
Ist 3x3 1424+3:4:5464+74+8 "52 *5ill 30 
Mean by Contingency — *5080* “4884+ 3017 
Product-Moment Value — “5189 *5000 *3000 
Classes concentrated at 
7x7 Product Moment Gaussian means and 5231 5023 3005 
corrections used 4 { 


method to give results of practical value? By practical value we mean results 
within ‘05 of the true value of the correlation, for very small weight is given in 
practical statistics to deviations of less than this order. We cannot do better in 
answering this problem than by taking the very surfaces (some of which were 
originally selected by Pearson to illustrate extreme non-Gaussian material) which 
Mr Yule has gone out of his way to collect, for they are very far from random 
samples of average statistical experience. 


These cases are (i) a hypothetical Mendelian surface constructed by Pearson 
and noted by him as skew at the time, (11) the barometric table for Laudale and 
Southampton, (111) the ages of husband and wife, (iv) the length of ivy leaves in 
various stages of growth—all cases selected as tests by Mr Yule. 


The following Tables give the data as we have used them. It will be seen 
from this material (1) how wide is the divergence from Gaussian type and (ii) what 
a large range of diverse classifications have been used. 


There is here extreme deviation from the Gaussian type, the arrays have every 
variety of skewness from the J-shaped curve of the zero couplets to the normal 
symmetry of the four couplets arrays. The actual correlation as found by the 
product-moment method is 4. 


Now in this case there is (i) no corrective factor for random sampling as the 
Table is a theoretical table and not a random sample from such a table, (i1) there 
is no correction for class-indices because the class-indices are the actual values. 


* The Ist 4x 4 and 1st 5 x5 were also worked out and gave respectively ‘5151 and ‘5174. 

+ The slightly greater divergence from the true value here was we believe due to the adjustment 
of the table to unit frequencies. 

t The Ist 6x6 and the Ist 5 x 5 tables were also worked out and gave ‘3049 and -3089 respectively. 

§ The original table of Father and Son with 1078 entries gave r=°5140; see Biometrika, Vol. 11. 
p. 378. 

{| Class-index corrections made: see Biometrika, Vol. 1x. p. 128. 
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We have ¢?='1251373 and C,=°3335 against actual 7 ='3333. A better illustra- 
tion of the value of C, properly used could hardly be imagined. 


TABLE XVI. 


Mendelian Inheritance of Recessive Couplets on Pearson’s Theory *. 
Case of Four Couplets. 


Number of Recessive Couplets in Parent. 


o 

> 

my 

a 

ES 
8 Bo 
Sears | 
22 & 

o 2,” 
ee 

S 2 | 
50 Totals 
Z ke 


We now take: 


TABLE XVII. 
4x 4 Table for Homotyposis in Ivy Leavest. 
First Leaf. 


(1) (2) (3) (4) * 
i Under 6-95 | 6-95—-10-95 -| 10-95—14-95 | Over 14-95 | 1°58 
o 
I} (1) Under 6-95 6984 
so | (2) 6:95—10:95 25320 
S| (3) 10:95—14-95 21312 
2 (4) Over 14°95 6384 


Totals 


In the original paper Pearson gives the correlation as 562 +009. Reworking 
the original table with use of Sheppard’s correction it is found to give 567. The 
Table is extremely skew, and owing to its method of construction—i.e. taking pairs 
of leaves out of 25 gathered from each of 100 plants—it is markedly lumpy. 
The massing into a 4 x 4 table has removed this lumpiness and the result probably 
represents the organic relation between leaves of ivy from the same spray better 
than the original table. We find: 


¢? = 353,307 when corrected for random sampling, C,=°51095, and the class- 
index correlation given by r°¢,72='870,404, which leads to ‘587 for the corrected 
contingency. No investigation has been made of the linearity of the regression, 
but it seems to us that the value of the correlation in its general sense may be 

* Phil. Trans. Vol. 203 A, p. 60. Here is another illustration of Mr Yule’s peculiar methods. This 
table takes some calculating, but Mr Yule does not give the table, so that his results might be verified. 
Further any reader of his memoir would suppose that Pearson had applied on this oceasion or on 
other occasions tetrachoric 7, to tables of this character, whereas the exact contrary is the fact, Pearson 


having been the first to apply product-moment formulae to theoretical Mendelian data, including this case. 
+ Phil. Trans. Vol. 197 A, p. 351, 


bo 
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even better given by the °587 of the contingency method than by the ‘567 
of the product-moment method. Anyhow the difference is of no practical signi- 
ficance ; and we again see that contingency applies effectively to skew material. 
We next take into consideration a 7x7 table, the barometer data from 


Laudale and Southampton. 
TABLE XVIII. 


7x7 Table for Barometer Heights at Laudale and Southampton. 


Southampton. 
| 
e| =] gs] 2#/ 8] 8] 8 
eee pes ese lee S| & 
eee | eee oO | | | + | ~ @ p Totals 
3 iS = S ee || 3 S 
ao 7 7 | REA 
a1 (1) Over 30:45 50 63°75 | 225 ad = we = 116 
*S (2) 30°45—30°05 42 503°25 | 248°5 64:5 625 | — — 864°5 
3 (3) 380°05-—29°75 — 193°5 | 340 221°25 39°5 | 41°75 2°5 838°5 
4 (4) 29°75—--29°55 — 35 120°25 | 169 | 45 72°75 15 457 
(5+6) 29°55—29°35 — 4°5 49°5 117°75 | 63°25 77°25 14°75 9327 
(7) 29°35—28°95 — — 175 | «(54 | 46 102°25 46°75 | 266°5 
(8) Below 28:95 | — ss a ar l 20 305 | 52:5 
92 800 778 | 627°5 | 201 314 109°5 2922 


This is a singularly unfavourable table for contingency methods for it is a well- 
known rule in practical working to avoid cells whose actual frequencies or those 
of independent probabilities are zero or a few units. We should therefore anticipate 
getting the best results in such cases from few divisions in which cells with zero 
or small entries rarely occur. We have numbered the divisions to correspond 
with the eye-colour data nomenclature. 


Order of aes. Mean Square Southampton Laudale - 

Table PuvislOlts Genticone? Ved OF, "yCy vay 
= | 

Xtal lee2ues: 4: Ot Oki: O 69791 ‘9667 9659 “75 
6x6 1:2:3:4:546:74+8 67755 *9570 9596 ‘74 
5x5 14+2:3:4:546:74+8 *63210 9345 9324 73 
Ist 4x4 14+2:34+4:546:7+8 “60952 “9089 ‘9136 ‘73 
and 4x4 142:3:4:54647+4+8 *61368 "9238 9164 “72 
Ist 3x3 142:34+4:5464+74+8 “58899 “8979 8973 ‘73 
2nd 3x3 14243 :4:546+7+8 55020 *8450 *8215 wD) 
3rd 3x3 Extra* : Sic Bo 57530 "8823 8736 “75 
4th 3x3 Extra* ... doe are *45500 “7972 *7540 “76 
Data classified to ‘1” give Product Moment 7=-780t Mean "744 


* Two cases of this for 3 x 3-fold tables had already been worked out by Pearson in his paper in the 
earlier part of this number and are cited here ; see Biometrika, Vol. 1x. pp. 136—7. 

+ Given as °757 in the original paper, Phil. Trans. Vol. 190 A, p. 455, which antedated the publi- 
cation of the correct ‘ Sheppard’s’ corrections. 
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Here again for practical purposes the mean square contingency gives quite 
good results. It is very unlikely that in practical statistical work stress would be 
laid on a difference of ‘05 occurring in a correlation of this magnitude. 


We now turn to the table of ages of Husband and Wife taken from the Census 
of 1901. We have first arranged an 8 x 8-fold table in unequal frequency 
groups, as follows: 


Age of Husband. 


TABLE XIX. 
Age of Wife. 
(1) 15—24 | 44112 194620 4962 162 40 16 8 243921 
(2) 25—3} | 15763 | 881320] 590180 | 14046 | 2669 645 186 17 | 1504826 
(3) 35—44 873 98200 910238 | 388586 78648 13986 3362 240 ff 1494133 
(4) 45—J9 90 6584 | 85583 | 178474 | 251519 | 58699 | 12223 576 | 593748 
(5) 50—54 43 2573 27000 56976 | 146990 | 194509 53322 2027 483440 
(6) 55—59 29 1166 | 10275 | 17810| 46192 | 110115 | 176506 | 78471 369940 
(7) 60—69 16 788 6570 10233 21314 50238 | 261167 | 102356 452682 
(8) 7O— ael| 184 1257 1780 3290 6759 36251 | 125302 174830 
Totals 1185435 | 1636065 | 668067 | 550662 | 434967 | 543025 | 238366 | 5317520 
The following cases were investigated by contingency : 
: Contingency Wife Husband 
Order of ie usban 

Division uncorrected Fs . Ten 

Table cas for classes "Ct "Cy ze 

6x6 14+2:3:4:5:6:7+8 “7821 *9470 "9346 88 

Doxo 14+2:3:4:546:7+4+8 "7677 ‘9447 ‘9327 87 

4x4 14+2:3:44+5:64+7+8 "7405 ‘9312 9293 “86 

3x3 142 :34+44+5 :64+7+8 “6995 *8959 "8954 87 


The true value of the coefficient of correlation by product-moment method 
hese values in an extreme case of skewness, rendered more complex by 
heterogeneity of the material*, so far from discrediting the method of contingency 
On material given by a 3 x 3-fold or 4 x 4-fold table 


= 925. 


1 


are definitely in its favour. 


no reasoning is likely to be based which would be modified by a change of ‘03 
to ‘05 in the correlation. We think that for even these exceptional cases—which 
rarely occur without warning in practical statistics—the corrected contingency 
will not lead any one astray fF. 


* Skewness of wife’s ages=‘71; skewness of husband’s ages=-76. The heterogeneity depends 
on the fact that the table represents the ages of all married couples without regard to second or later 


marriages. 
+ Here are a few comparisons of results reached by the method of pseudo-ranks with those deduced 


by contingency : 
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What has Mr Yule to place against this method for tables 3 x 3-fold up to 
6 x 6-fold in classification? He writes: “From several trials—more than are here 
given—I have come to the tentative conclusion that the best guide to the cor- 
relation that would be found for given data, if the grouping were other than that 
which in fact it is, is the correlation for the existing grouping, provided that you 
are given at least some five or six arrays” (loc. cit. p. 618). He does not venture 
to inform us what he would do for a table of 3 x 3, or 4x4 cells! As a matter 
of fact such tables can give even by Mr Yule’s method of pseudo-ranks better 
results than the 5 x 5 or 6x6 groupings. Mr Yule is, however, not content with 
his statement that his method is tentative; before he has done with it* he has 
assumed that by increasing his classes he will approach a limit which is the true 
product-moment correlation. As a matter of fact there is no such approach at 
all; the Yulean method of pseudo-ranks may give a better result for a lower than 
a higher number of cells, and if it did go far enough to reach a limit, it would 


: 0-5 Gaussian Correlation 
Order of Barometer Results Age of Husband and Wife | arranged in Father and Son 
Table. EKye-Colour Groups 
Number 
of - = 
Classes Conti F ; 
ontingency | Pseudo-Ranks | Contingency | Pseudo-Ranks | Contingency | Pseudo-Ranks 
Ctl “15 73 —- — “49 “46 
6x6 “14 “13 88 “89 “48 “45 
5x5 73 72 “87 88 “48 “44 
ist 4x4 73 67 “86 “86 “49 *42 
and4 x 4 ‘72 “T1 -— a 48 “43 
Ist 3x3 73, 66 “91 83 48 ‘41 
2nd3 x3 273) 65 “84 81 cou “40 
3rd 3x3 ‘15 67 — = — — 
4th 3x3 16 52+ ‘87 ‘79 — — 
Mean “744 674 “890 845 488 431 
True r “780 780 “925 925 500 500 


It will be seen that the method of contingency, especially with few classes, is markedly better than 
that of pseudo-ranks. We have purposely introduced the Age of Husband and Wife, because the 
divisions there have ranges not very diverse in magnitude. In such cases the method of pseudo-ranks 
becomes almost exactly that of the true correlation of variates, if the proper Sheppard's correction be 
made, In the case of Ages of Husband and Wife, if this correction be included, we are practically 
finding the true correlation by the method of pseudo-ranks. It is remarkable that Mr Yule has not 
drawn attention to this, because it at once indicates how fallacious the method is, if the subranges 
be unequal. 

* On the basis of what he starts by calling a tentative method, he then proceeds to assert that all 
the biometric pigmentation work is “‘ wholly untrustworthy ” (loc. cit. p. 622)—a characteristic illustra- 
tion of how Mr Yule’s mind rapidly grows obsessed by a theory which he has not properly investigated. 


+ This case is of remarkable interest as indicating the futility of Mr Yule’s method of pseudo-ranks, 
The table although 3 x 3-fold has sensibly equal ranges. Therefore the correction to pass from ranks to 
variates is closely Sheppard’s. The Yulean pseudo-ranks coefficient thus corrected is raised from -522 
to ‘775, close to the true correlation ! 
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give the correlation of ranks and not that of true variates*. The fallacy of 
Mr Yule’s arguments and the extreme inferiority of his method to that of con- 
tingency will be manifest in the following comparisons. We, however, draw the 
attention of the reader to this: that if the method of pseudo-ranks did approach 
a limit it would be the correlation of ranks uncorrected for huge brackets, i.e. we 
should still have to correct for passing from ranks to variates and for class-indices. 


Comparison of Method of Contingency with Method of Pseudo-Ranks. 


Coefficients deduced from Table XV, Stature of Father and Son. Product- 
Moment Correlation =°52. 


| 
Order of Nature of Divisi Pearson’s Yule’s 
Table aE Comey roe Contingency | Pseudo-Ranks 

ese IED} Bey Stl a fasts Gi 7/ oI “49 “48 

6x6 | 1:2:3:4:54+6:7+8 “49 48 
Ist 5x5 | 1: 943-44 5546 27238 5) 37 
Qnd5x5 = 142:3:4:546:748 52 “AT 
Ist 4x4 | 142:3:4:546+47+48 5] “46 
Qnd 4x4 1:2+34+4:54+6:7+8 52 “36 
3rd 4x4 14+2:3:4:54647+4+8 51 "46 
Ist 3x3 | 14243 :4:5464+7+8 "52 37 
gnd 3x3 | 142:344:5+4647+4+8 51 “44 


The inferiority of the method of pseudo-ranks will be obvious. The contingency 
gives as good results for a 3 x 3 table as for a 5 x 5 table+; but for two different 
tables of the same order the method of pseudo-ranks will give results differing by 
as much as ‘10, ten times the difference of the contingency method. 


Here is another Tablet, to which Mr Yule has applied his method of reaching 
a limit to the actual correlation, namely that for eye-colour for pairs of brothers. 


Mr Yule having so to speak ingeniously “dressed the window” to show a 
talling correlation of pseudo-ranks with his increase of classes, then asserts that this 
pseudo-rank correlation approaches a limit below 0°28, and extending his fallacious 
reasoning holds that this limit of ranks is the limit to the true correlation of 


* The fog in Mr Yule’s mind on this subject is well illustrated by his table on p. 619. He takes 
a table for a Gaussian distribution of correlation 0°3 and says that with an infinite number of classes 
the Yulean coetficient would become 0-3. There is not a trace of any knowledge on his part that 
the limit of a process by which unit-range is given to each individual is not the same as that of a 
process by which wnit-area of the frequency curve is given to each individual. Here and elsewhere 
he makes no distinction between the correlation of variates and the correlation of ranks. In the actual 
case his limit would have been ‘2876 and not ‘3000, but far greater differences will arise if the 
material be skew. We return to this point later. 

+ Like all statistical methods, that of contingency must be used with due regard to the data to 
which it is applied and to the manner in which it is applied. Compound or heterogeneous material 
may give a contingency coefficient differing considerably from true correlation, and groupings of great 
inequality in the cells may render idle the corrective factor, e.g. if the great bulk of the material be 
placed in one or two cells. 

+ Phil. Trans. Vol. 195 A, p. 140. 
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TABLE XX. 


First Brother. 


Totals | 


Dd 
id 
ee} 


5+ 


H 2 3 | 4 


98 
833 
901 
443 
297 
297 
131 


Second Brother. 


3000 


variates, Now it is clear that there is no such general rule about the correlation 
of pseudo-ranks always moving in one direction. It is possible within certain 
limits to vary that correlation in an almost endless manner according to where 
we take our divisions. It is far more influenced by the size and position of our 
“brackets” than by whether we work with a 3 x 3-fold or a 7 x 7-fold classification. 
We can choose in this case a 3 x 3-fold table to give this spurious coefficient of 


Resemblance of Eye-Colour in Brothers. 


rder of sae Pearson’s Yule’s 
Table Nature of Divisions Contingency | Pseudo-Ranks 
exall Th he Le tay tGy Ry ato} 51 29 
Ist 6x6 14+2:3:4:546:7:8 52 29 
2nd 6x6 [eee Sate uD iOue ao 49 29 
5x5 142:3:4:546:748 “50 *30 
Ist 4x4 142:3:4:54+647+8 51 33 
Qnd 4x4 1424+3:4:546:7+8 days! 27 
3rd 4x4 142:34+4:546:7+8 "44 27 
Ist 3x3 142+3 :4:54647+4+8 BA 29 
2nd 3x3 142:34+4:5464+748 | “44 | 530) 
brd 3x3 142:3:445464+7+4+8 yO) 36 
Mean ee ae — 498 > 28% 


Mr Yule any value from ‘19 to almost ‘40. This flows from the fact that the 
class-index correction may take a wide range of values according to the arrange- 
ment of the classes, and Mr Yule makes no allowance whatever for this fact. 


* “Can we have any hesitation in similarly estimating the correlation for the eye-colour table, if we 
were in a position to adopt a finer and more uniform grouping (without assuming that we will compel 
that grouping to give us a normal distribution) as something slightly less than 0:28?” Yule, loc. cit. 
p. 619, 

+ It is not possible to correct the Yulean pseudo-ranks correlation (i) for passing from ranks to 
variates, because Mr Yule not only rejects any appeal to the Gaussian, for which we know the proper 
correction, but because his assumption of unit ranges precludes the use of that curve, nor (ii) for class- 
index correlation, because the same assumption hinders any rational method of finding the class-index 
correlations. 
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We have seen that the corrected contingency gives a good practical approach 
to the actual correlation, even if the material be skew as in the Husband and 
Wife or in the Barometric Data; are we to argue from the entirely fallacious 
reasoning of Mr Yule that the method is in this case on the average 44°/, in 
error although in the skew Mendelian table it is only ‘06°/, in error and in 
the Husband and Wife data only 3°8°/, in error? We shall need some far better 
reasons for believing that the value usually assumed for the correlation of eye- 


colour in brothers, i.e. cirea 50, is in error, than such as Mr Yule seems able to 
adduce. 


Another piece of remarkable special pleading on the same lines is that 
provided by Mr Yule in the case of Pearson’s tables for parental heredity in coat- 
colour in horses*. Those tables are remarkable in their nature, because although 
16 classes are formed, there are practically no entries except in the three main 
groups Brown, Bay and Chestnut. Here is the frequency distribution for sires, 
where bl. = black, br. = brown, b. = bay, ch. = chestnut, ro. = roan, gr. = grey: 


| 
bl. | b1./br. | br.[bl. | br. 


br./b. | b./br. | b. | b.feh. | eh./b. | eh. ch./ro. | r0./ch. ro. | r0./gr.| gr.[r0.| gr. | 


pots | 6 |< 


Now Mr Yule arranges this in 11 classes: 


| 
ae 


209 | 19 | 6 | o | 32 | 1 | o| o | | 
and also in three classes, presumably as: — 
| 0 | — | 222] —| m0 | — | 369 | — | o| — | 0 | 


He then obtains sensibly the same values for the two groupings and speaks 
of the equality of the pseudo-correlation of ranks thus obtained as marking in 
some way a limit to the correlation of the variates! Naturally he would obtain 
almost identical values, because the whole calculation of products and moments 
turns on the three dominating groups of brown, bay and chestnut and practically 
all he has done in his arrangements of three groups and eleven groups is to call 
his sub-range unity in one case and two in the other ! 


There is, we believe, only one classification possible of these tables on the 
reasonable assumption that the amount of pigment forms a continuous variate; 
namely that which makes a 3 x 3-fold division between brown, bay and chestnut 


* Phil. Trans. Vol. 195 A, pp. 122 et seq. 

+ I still see no error in my original classification by amount of pigment; there are more melanin 
pigment granules in the brown hairs than in the bay, and,—if we disregard the black chestnuts, which 
are very rare among thoroughbreds as compared with hackneys,—more in the bay than in the chestnut; 
the latter colour depends more, and in some cases almost entirely, on diffused pigment. K. P. 
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and either excludes the few roans and greys or, as done below, throws them into 
the lightest group. The threefold tables are then : 


| 
203 | 528 | 269 } 1000 


216 | 521 263 1000 


Sire. Sire. 
79 | 132 | 47 | 958 58 | 102 | a7 | 187 | 
3 | 105 | 426 | 132 | 663 So lecces| 406 J) 99, 1 571 | 
So 37, 152 190 379 = 25 121 | 146 292 | 
PS eae et fy —— 
221 710 | 369 | 1300 149 629 | 272 | 1050 | 
Dam. Dam. 
82 73 23 178. | 63 83 23 169 | 
eo) 101 319 99 519 »| 106 327 88 521 
a 33 129 141 303 ee 34 118 158 210 
| 


| 
| 


And the corrected contingency coefficients are : 


Sire and Colt: °41, Dam and Colt: °45, 
Sire and Filly: -47, Dam and Filly: °46, 
Mean: ‘45. 


Mr Yule states that the correlation is in this case of the order ‘33. Are we 
again to assert that the mean of the contingency coefficients shows an error of 
"12, or is 36°/, in excess of the true correlation, when as we have seen the error 
we find in even extremely skew distributions is of the order of 4°/, and under ? 


Mr Yule says that the correlation of his pseudo-ranks is °33. Well and good, 
then the correlation for variates would be about ‘35, probably more for skew 
variation, and the factor for correction of class-indices about ‘80, or the true 
correlation is of the order ‘35/80 ='44, which brings it strangely nearer to the 
value found by contingency than to Mr Yule’s }! 


It is quite true that the values obtained in the original memoir were higher 
than ‘45, but the theory of contingency was not then developed; when a fourfold 
table had to be formed there were very good reasons for dividing it in the way 
actually selected. In the first place the division between Chestnut and Bay was 
physiologically more reasonable than one between Brown and Bay, which would 
throw the Chestnuts into the Bays. In the next place it was the division nearest 
to the median and so liable to the least error from either random sampling or skew- 
ness. The other, the asymmetrical, divisions are less reasonable because they 
give one quadrant with only 2 to 3°/, of the total frequency in it, they divide 
parent and offspring differently, and mix in one or other case Bays with Chestnuts. 
Let us suppose with Mr Yule that these tables did show a correlation of 4 (which 
they certainly do not), then we fail to grasp why Mr Yule should not get his 
Mendelian 4 quite directly without elaborating an erroneous theory of pseudo- 
ranks and using 3 x 3-fold and 11 x 11-fold tables to show approach to a limit. 
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The Mendelian 4 should come at once by the simple division of the tables into 
Chestnut and not-Chestnut and it comes pretty closely indeed by taking only the 
Chestnut groups of the 3 x 3-fold tables, as indeed it must do if we remember the 
rarity with which a Chestnut x Chestnut gives other colours. Our four tables 


become: 


Sire. Sire. 
Totals NEC! C. Totals 
+ 921 a .| 6 126 | 758 
tS) 375 = ; 4 292 
5 79 fe : 146 
| Totals 1300 272, | 1050 
| =. 
Dam. Dam 
Totals | Totals 
S| NO 697 Bl NAC. 690 
Cae 303 Ta a aba 310 
[ is 
Totals 3 1000 


The pseudo-rank correlations, @, are : 
Sire and Colt: +3094 
Sire and Filly: 3414 
Dam and Colt: ‘3030 
Dam and Filly: 3638 


Mean = '3294 
Could a better demonstration of the Mendelian } correlation be possible ? 


Now let us look at a similar arrangement of other data in which the true 
correlation is actually known. We take the following cases with approximately 
similar total frequencies : 


From Table XV. From Table XIV. 
Stature of Father. : Gaussian Surface for °5. 
é , a : 
8 1-4 | 5—8 | Totals 
M | 
._— , 
° 1—4 659 97 756 1—4 
5S 143 101 244 =o 
5 ———_— 
S$ | Totals 802 | 198 1000 Totals 
OD 
Actual Correlation °518. Actual Correlation °*500, 


Yulean we 7308, Yulean w. 1302, 
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From Table XIII. 


Gaussian Surface for °3. 


1—4 | 5—S& Totals | 
1—4 634:97 121:03 756 
b— 8 167-03 76:97 244 


Totals | 802 | 198 1000 


Actual Correlation 300. 
Yulean sop KB 
In other words for any surface approaching the Gaussian the Yulean ¢ for a 
fourfold table with such frequencies in the total columns must be raised by ‘20 in 
the scale of correlation. Let us consider the same sort of tables in non-Gaussian 
material, material of Mr Yule’s own choosing; we have the following fourfold 
tables : 
From Table XIX. 


Age of Wife. 


x a b—f Totals 
uy SO : 
© 
os a 1,135,815 612,932 | 1,748,747 
Be b—f 110,553 | 3,458,220 | 3,568,773 
an 
| Totals 1,246,368 4,071,152 5,317,520 
Actual Correlation °925. 
Yulean .. 686. 
From Table XVI. Heredity of Mendelian Couplets. 
Father. Father. 
; Totals | Totals 
2 eee) OD | 
ail 0 1296 sal G=1 3024 
By) 1-4 2800 S| 2-4 1072 
a — SS om —EEE 
© | Totals | 1296 | 2800 ] 4096 | ©} Totals] 3024 | 1072 | 4096 
L | : 
Actual Correlation °333. Actual Correlation +333. 
Yulean we 243, Yulean see, 243; 
From Table XVIII. Barometer Heights. 
Southampton. Southampton. 
Totals | 
3 os 
S 205325 | 2276 i | 72 
S 2 2 
5 244-25 646 See 
5 — Si 
2922 Totals | 892 | 2030 | 2929 | 
Actual Correlation 780, Actual Correlation *780. 


Yulean ve OBL, Yulean w. 566. 
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Second Leaf. 


From Table XVII. Ivy Leaf Length. Whorls of Woodruff *. 
First Leaf. Members of First Whorl. 
Under 6°95 | Over 6°95 | Totals = = Under 8 | 8 and over} Totals 
bee ay ee 
Under 6:95 3358 3626 3 = | Unders 7268 2281 9549 
Over 6°95 3626 49390 = ~ |Sandover} 2281 1400 3681 
g ee 
oO fo) | SS 
Totals 6984 | 53016 = Totals 9549 | 3681 13230 
= = a eae (op) as : \ 
Actual Correlation ‘567. Actual Correlation ‘173. 
Yulean son 42: Yulean ree oa 


Now let us put together these results in a Table: 


Actual Yulean for : Percentage 
Correlation Fourfold GE Increase on } 
“925 686 239 35 
*780 566 214 38 
“780 531 "249 47 
“567 “412 "155 38 
“518 “308 210 68 
“500 302 198 66 
333 243 ‘090 37 
*300 ‘167 133 80 
173 141 032 23 


Now these results bring out the important point that whether the distribution 
of the frequency be Gaussian or not, we may have to add anything from 23 °/, up 
to 80°/,+ to the value of ¢@ as found from a fourfold table to obtain the true 
correlation. For values about °33 we may have to add anything from 37°/, to 
80°/,. For the tables for coat-colour in horses we must add to the Yulean at a 
moderate estimate something like 40°/,, which gives a value, not near the 33 of 
Mr Yule, but near our ‘46 and close to the value found by contingency or closer 
to the values originally assigned by the tetrachoric 7, method. 


If Mr Yule continues to assert that the true value of the correlation of these 
tables is $, then he may as well argue from the ¢ of the fourfold tables obtained 


* Selected as a case of irregular and skew correlation of a low product-moment value; see Phil. 
Trans. Vol. 197 A, p. 325. 


t+ One Ivy Leaf distribution as below actually gave Yulean p='278 against actual correlation 
567, requiring a 104°/, increase on the Yulean! 


First Leaf. 
3 Under 14:95 | Over 14:95 Totals 
4 Under 14:95 49562 | 4054 53616 
a Over 14:95 4054 2330 6384 
z Totals 53616 60000 
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by taking as classes Chestnut and non-Chestnut. But in that case his only 
logical standpoint is to assert that inside the Chestnut and outside the Chestnut 
there are no hereditary differences of pigment, that Chestnut is a unit character 
that differs by a unit from all other shades in Bay and Brown. This, however, is 
not a fact as a very little microscopic examination of horse hair would show him, 
Pearson’s tables themselves indicate that excluding Chestnut, there is correlation 
of intensity of pigmentation between parent and offspring; further unpublished 
material indicates that within the Chestnut and for different shades of it the like 
relation holds. If there be a correlation between parent and offspring inside and 
outside Chestnut the value for the fourfold tables in the Chestnut and non-Chestnut 
unit classes must be a minimum and not a maximum value as Mr Yule asserts, and 
his criticism collapses with the fallacies on which he has constructed it. 


The chief of these fallacies is the principle that stress of some kind can be laid 
on the Yulean, or pseudo-rank coefficient proceeding to a limit; the fact is that it 
can be made anything we please by a suitable choice of divisions. The divisions 
which for a given number of cells give it a maximum value are those which make 
the sub-range frequencies of the two variates equal. Every deviation from this 
equality lessens the value of the Yulean, whether the deviation consists in heaping 
up the frequency at one or both ends or in the middle of the variate range. 


In the following table are a few results for the pseudo-ranks coefficient for 
3 x 3-fold divisions of the Husband and Wife Table (see our p. 224). They fully 
substantiate the view that it can be made to take almost any value by a proper 
choice of grouping in the scale classes : 


Wife Groups Husband Groups en 
alee 2:3—8 1—6 : C28 | 184 
l—6n: TL Bie dis Do Bits} 254 
1—3 : 4:58 1:2—7:8 268 
1G hese 2 {3} =} 2 Bit) 2 ha} 297 
1:2-7:8 1:2-—7:8 ‘B51 
| il=ale 5 : 6—8 Ay: DFeO—o *605 
| 1—2:3—4 ;5—8 1—2 : 3—4 : 5—8 809 


The whole fallacy becomes at once obvious in the light of the class-index 
correlation correction, for even with a 6x6 or 7x7 table wide changes may 
be made owing to changes in the classification influencing the class-index 
correlation *, 


Mr Yule has selected the following Series A for Brother-Brother’s eye-colour 
in order to show that @ decreases to a limit of less than ‘28. But why should 
he not have taken Series B in order to show that it increases to a value greater 
than *28 ? 

* Cf. Table XI, p. 218. 
Biometrika 1x 30 
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Mr Yule accentuating Series A writes (loc. cit. p. 619) “The result emphasises 
the entire non-normality of the eye-colour table. For the normal distribution 


| ; 
Order of | eras f Yulean SEamEg Yulean | 
Table | Divisions Divisions 

| 

Dae, 14243 :44546+7+8 B34 14+243444+54+6:7+4+8 ‘16 
3x3 | 142:3:44+54647+48 36 14+24+384+44+54+6:7:8 ‘17 
4x4 (eo Si 46/6 BB) oe a eae ytE(G} 8 7/ 2 {8} 20 

| 419) | 3 Ae 16 eS *30 1424+344:5:6:7:8 21 
6x6 1:2:3:4:546:748 30 TOP se sy eG toy ONG} 8 7/9. {3} 25 
Coed eh Sash le selat ay Gye Gus} 29 ISR hil pels) B (5) 8 yf Sf) 28 
8x8 1g Are ec vag 53. 8 Loy Se TPES te) | 28 pai es eye be toy 2 (6) G7) 8 £8} 28 


the correlation gradually increases towards the known true value as the number 
of arrays 1s increased: with five or eight arrays, notwithstanding the extreme 
uregularity of the grouping, we have the same moderately good approximation 
to the correlation as is given by the coefficient of contingency in this case*. 
For the eye-colour table the correlation decreases as the number of arrays is 
increased.” 


The fact is we can select series of table divisions for some of which the 
Yuleans go up, for others they go down, and for still others first go up and then 
go down. That any series must ultimately reach ‘28 is obvious, because that is 
the value of the only possible 8 x 8 table, but a 9x9 table might equally well 
show ‘24, and then we suppose that taking the down series Mr Yule would have 
asserted the limit to be ‘24. But suppose Mr Yule’s data had stopped at a 
5x5 table, then according to the nature of that 5x 5 classification, Mr Yule 
might have found ‘21, or 33+ as his limit, for all his tables of lower order must 
have gone up or down to those limits! 


Mr Yule’s method if indefinitely continued would lead him to the correlation 
of ranks, but what relation this would have to the correlation of variates 1n cases 
of markedly skew variation with emphasised deviation from linearity no one at 
present is in a position to say. Some light, however, can be thrown on it by 
considering the actual deviations produced in calculating means by aid of Mr Yule’s 
hypothesis which places unit distance between each group of individuals, and 
ultimately between each individual}. Let d be the range Mr Yule assumes 
between each individual of a population n and R=(n—1)d the total range. Then 


* In this case, namely the Gaussian for -3 correlation, the Yulean gives ‘26 for both 5x5 and 8x8 
tables, the corrected contingency gives -31 and ‘30 for these tables; this is Mr Yule’s idea of the 
‘“same moderately good approximation” ! 

+ Using the division 1;:2:3:4+5+46+7:8. 

+ It is remarkable at this stage of statistical progress to find any one so incapable of appreciating 
Galton’s work on first and second prizes as to replace the equal areas occupied by individuals by equal 
ranges ! 
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it is obvious that Mr Yule’s mean character will always coincide with his median, 
or his 
M = mean = character of individual with minimum value of variate +4 R. 
re i 2 
o* = (standard deviation) = -~ (1+ —) Die 
12 n—I 
or, if n be at all large, 


Cb ee 3). 


It will be clear that for large numbers 


maximum value — minimum value of variate 
SS =r = 2/3 er cea , 
a relation entirely opposed to the practical independence of range and standard 
deviation in variates with which we are familiar*. But another difficulty at once 
arises ; 1n actual practice the subranges have all sorts of different values, and we 
may know one or more of them. Which one of these subranges is to be taken as 
the standard unit and have the range expressed in terms of it? Some numerical 
illustrations will emphasise the extraordinary difficulties, not to say contra- 
dictions, of Mr Yule’s process of treating subranges as equal units in determining 
correlation. 


For example, the Registrar-General gives ages of Husband and Wife from 15 
to about 100. Hence by Mr Yule’s method : 


Mean Age of Husband = 57°5. years, 
Mean Age of Wife = 57°5 years, 
Standard Deviation, Husband = 18°7639 years, 
Standard Deviation, Wife = 18°7639 years. 


The actual values are: 


Mean Age of Husband = 42°8306 years, 
Mean Age of Wife = 40°5838 years, 
Standard Deviation, Husband = 13:0649 years, 
Standard Deviation, Wife = 12°6813 years. 


Thus the Yulean values may differ by 30°/, to 50 °/, from the true values. 


To assume that the skew distribution is Gaussian will give much better results 
than this. Let us illustrate it on the very skew Husband and Wife, Barometer, 
and Ivy Leaf data. In dealing with actual data, we have to express the means of 
a variety of arrays in terms of a known subrange common to them all, e.g., bay 
colour in horses or hazel eyes in men. We will apply in succession Mr Yule’s 
hypothesis and the normal curve to the above data. 


* For such frequencies as occur in practice it is much safer to take the range about 6 times rather 
than 3°5 times the standard deviation. 
30 —2 
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Ages of Husbands in the Eight Groups of Table XTX. 
True Mean Age of Husband 42°8306. 


eri} 
By GAussIAN By YULEAN 
Found from 
Group . za x 7 
Value Deviation Value Deviation 
2 88°57  —4°26 46°32 + 3°49 
3 41°14 — 1°69 46°32 + 3:49 
4 40°48 — —2°35 45°66 + 2°83 | 
5 40°20 — 2°63 45°66 + 2°83 
6 40°13 — 2°70 45°66 + 2°83 
tf 4191 | -—0'92 31°32 —11°51 
7 I . | 
Mean... — 2°43 = 4:49 | 
From whole ; “67 
Range = = 57°50 +14°67 


| 
In five cases out of six the Gaussian gives better results than the Yulean in 
the simple matter of finding means even for such a skew distribution as those of 


ages of Husband and Wife. 
Now let us turn to the Ivy Leaves (Table of Breadth) * : 


By Gaussian By YULEAN 
Groups = 
Value Deviation Value Deviation 
5:95— 7:95 11°83 —1°39 14°18 +0°96 
| 1399-8390: 12°93 —0°29 11:06 —2°15 
8:95— 9°95 12°60 | —0°62 11°06 —2°15 
| 9°95—10°95 12°57 — 0°65 11°06 — 2:15 
| 10°95—12'°95 12°87 — 0°34 11°18 — 2°04 
| Mean an poe _ 0°66 — 1°89 
From whole Range = — 19°95 +6°74 


It will be seen that with one exception the true mean breadth (13'2148 for this 
very skew distribution of ivy leaves) would be substantially better found by using 
the Gaussian than by Mr Yule’s assumptions. 

Taking the barometric height at Southamptont we have, noting that the actual 
mean height is 29-9814, the results given in the table on the following page. 

In every case the Gaussian, we see, gives markedly better results than Mr Yule’s 
method. . 

We think it safe to conclude that the Gaussian can be used to give quite a 
good approach to the means of variates classed in ‘broad categories”; it is far 

* Phil. Trans. Vol. 197 A, p. 352. Breadth instead of Length taken for that character appears still 


more skew. 
+ Phil. Trans. Vol. 190 A, p. 428. 
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more adequate than the Yulean pseudo-ranks method. It must give better results 
than a process which concentrates at unit-distances and renders any attempt at 
class-index correction impossible. 


By GAUSSIAN By YULEAN 
Groups 2 - —|— —- 
Value Deviation Value Deviation 

29°5 30036 +°055 | 29°695 — ‘287 

29°6 to 29°9 30006 +°025 | 30°128 +147 
30:0 to 80°1 ee 295998 +016 30°039 + 058 
30°2 29°997 +016 30°095 +:°113 

30°38 to 80°5 30°007 + '026 29°784 — 198 
Mean ae tes = ‘028 = 161 
From whole Range — — 29°750 — °231 


It may be said that Mr Yule has not used the Yulean to find means; in 
appearance perhaps not; in actuality he certainly has, for all product-moment 
processes reduce in actuality to finding the means of arrays. In fact: 

— S (nayxy)/N — ey 
Ox Ty 
S S(nz£Yx)/N — 2y 


Tx Oy 


Here w in the summation term should be really the mean w of the individuals 
in the class nz. Thus in finding 7 we actually use the means @ and y of the two 
variates, the means ¥, of the y variate for all individuals in the class n, of w’s, and 
the mean « of all the individuals in the class nz. Mr Yule’s method, as we have 
just seen, must lead to big errors in all these means; it also as we have seen leads 
to big errors in o, and o,. Hence if 7 comes out near the true value, this can 
only arise from a compensation of errors, the exact measure of which has so far 
only been determined for the case of a Gaussian distribution. 


(11) The Hye-Colour Data. 


We now turn to the eye-colour data for parent and offspring where we think 
Mr Yule has been led into precisely the same fallacies by his method of pseudo- 
ranks as in the coat-colour of horses. The colour shades recognised by Francis 
Galton in his inquiries were: 

1. Light Blue. 
2. Blue, Dark Blue. Brown. 

3. Grey, Blue-Green. Dark Brown. 

4. Dark Grey, Hazel. 8. Very Dark Brown, Black. 

The book of data presented by Francis Galton to Pearson in 1899 contains 
under these entries a record of each family, grandparents, parents, offspring, uncles 
and aunts. No other data concerning the family were provided ;—only quite 


Light Brown. 


TD 
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recently have the original records come into possession of the Eugenics Laboratory 
and we now find they contain the important desideratum of age. It is proposed 
shortly to revise these tables of eye-colour, paying regard to the increasing pigmen- 
tation of the eye in extreme infancy and the decreasing pigmentation in extreme 
age. The inclusion of all cases, as no age was provided, has possibly something 
to do with the admitted irregularity of the tables. But does this irregularity 
invalidate the main conclusion drawn from the tables, i.e. that eye-colour pigment 
is inherited at the same rate as the measurable physical characters with a 
correlation lying somewhere between ‘46 and ‘50? The divisions made originally 
in the tables were selected purposely and with very definite ends, namely, 
(1) to give as good a physiological difference between the two groups as possible, 
(11) to put into the same class the same eyes of both parents and offspring, and 
(11) to get the least probable error by taking the divisions as near the median 
as was practically realisable. Mr Yule disregards the importance of (ii) and (111) 
and gives the values of the tetrachoric 7, found from a number of divisions, 
several of which have so few individuals in the quadrant that they are extremely 
untrustworthy. At the time the data were dealt with there was very little avail- 
able knowledge as to the distinction between a blue and grey eye. They were 
put into the same class, because it was considered that the total pigmentation of the 
iris of the grey eye was more akin to that of the blue eye, than to that of eyes like 
hazel with some macroscopic anterior pigment. The difference between the blue 
and the grey eye was considered to be one of structure rather than of pigment. 
With what we know of eyes now, we are not prepared to accept the Mendelian 
classification of eyes into those without and those with anterior pigmentation of 
the iris. If such a classification were absolutely legitimate, then grey eyes ought to 
be put with non-blue or with blue according to whether they possess such pigment. 
On such a classification as we have indicated the @ for inheritance of eye-colour 
between parent and offspring ought to be 4. Accordingly to test the position for 
the grey eye we ought to consider whether the correlations come out more nearly 
+ with the greys put with the browns, or put with the blues, ie. whether ¢ for 
a fourfold table is nearer $ for the division 1+2:3+4+4+5+4+6+4+7+48 or for 
14+24+3:44+54+6+7+8. There cannot be a moment’s hesitation as to which 
is the more strictly Mendelian division. We find: 


Values of Boas-Yulean @. 


Pair Blue only, Blue and Grey, 
a ie. 142 ewer o es 
- se 2 | | ee 
Father and Son ... a "33 37 
Father and Daughter... 29. 28 
Mother and Son ... oes 28 32 
Mother and Daughter ... *24 “34 
Mean jee <i 27 33 
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There seems no doubt that from the Mendelian standpoint, if anterior pigment 
be taken as a “unit,” then pure grey eyes should be included with the blues and 
the correlation then comes out the true Mendelian third. Thus the original 
division of the tables between grey and the hazel groups, which appeared at the 
time the most reasonable physiologically and statistically, is amply justified by the 
theory of posterior and anterior pigment. 


Now Mr Yule tells us on the basis of his erroneous theory of pseudo-ranks 
that “the average estimated correlation” of these eye-colour “ tables is something 
like + not $” (loc. cit. p. 620). We have indicated earlier in this paper (p. 232), 
that if a fourfold table for continuous variates—skew or Gaussian—give by ¢ a 


Diacram VI. Regression of eye-colour with eye-colour in brothers, the grade intervals being 
assumed to give a normal distribution. Inset, the same, colours 4 to 8 only, the grade 
intervals being of same value as before, to show resemblance within the darker grades only. 
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correlation of ‘33 this must be increased by something like 40°/., if we wish to 
find the true correlation of the variates. Had Mr Yule asserted that eye-pigmen- 
tation was not a continuous variate, that all brown and hazel eyes were alike and 
differed from blue and grey by the possession of some mysterious unit character, 
then he would have been justified by the mere fourfold tables in asserting that the 
correlation was t. In doing this he would have taken up the simple Mendelian 
position. That he did not do so was probably owing to the fact that he recognised, 
as the tables indeed show, that within the two groups, 1+2+3 the blue-grey 
group and 44+5+6+ 7+8 the hazel-brown group, there was distinct heredity of 
sub-divisions. The father with hazel eyes has an offspring less pigmented than the 
father with dark brown eyes. That such heredity exists must dispose at once of a 
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correlation of 4 as representing the heredity of eye-colour in man. And here it 
may be well to point out a very distinct difference between the problem which 
Pearson set himself and the problem for which the Mendelians profess to provide 
a solution—both speak of the heredity of eye-colour—but in no sense deal with 
the same problem*. The Mendelian says: “I treat the heredity of the character, 
presence or absence of anterior pigment.” The correlation in this case should 
be 3. This problem, however, is not that of the heredity of the various grades of 
pigment in the iris. There is very little doubt that grade of pigment is a perfectly 
continuous variate. The pigment of the iris is not solely confined to anterior and 
posterior faces, and quite different grades of pigment can occur in both these 


Diagram VII. Regression of eye-colour of brothers, the grade intervals being supposed equal. 
Inset, the same, colours 4 to 8 only, to illustrate the reduction in the regression—which 
is here the correlation—produced on the pseudo-rank hypothesis. 


Second Brother. 


(Light) 


(Light) First Brother. (Dark) 


situations, until you get in the albinotic eye almost a complete absence of pigment. 
The accompanying diagrams, which have been drawn by two different methods, 
show how the heredity of eye pigment holds as well inside the blue-grey group as 
inside the hazel-brown group. They have been obtained in two ways, (i) by the 
use of the Gaussian to obtain the means of the arrays, and the relative ranges of 


* Even in the treatment of this problem we have only the papers of Hurst and Davenport which 
have been assumed to confirm each other. Asa matter of fact on Hurst’s postulates as to the methods 
of observation, Davenport ought to have reached results discordant with Mendelism instead of con- 
firmatory! For the nature of these authors’ work on similar problems, see p. 209 above and Biometrika, 
Vol. vir. p. 403 and Vol. vii. pp. 269, 271, 272. 
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the eye-colour, (11) by Mr Yule’s method of unit groupings to indicate that even 
that method shows the same result, the inheritance of the intensity of pigment 
inside the two main groups*. In an inset figure we have given, by the same 
methods, the increasing pigmentation of the second brother as the pigmentation 
of the first brother increases, when we remove all the blue-grey group from con- 
sideration. The general weakening of the correlation produced by using the 
method of pseudo-ranks will be obvious if Diagrams VI and VII are compared. 
The later diagrams, Nos. VIII and IX, show precisely the same point for the 
parental eye-colour data. It will be clear that if 4 were the limit to the corre- 
lation of these eye-colour tables, then all the correlation within the blue-grey and 
hazel-brown groups ought to have disappeared. Instead of this we find that the 
z 18 only the correlation on the Mendelian hypothesis that there is a unit difference 
between individuals in the one group and individuals in the other; whatever 
this “unit difference ” 
in pigmentation, because there is correlation within the groups. 


may refer to, it does not refer to a quantitative difference 


We have then the following results for fourfold parental eye-colour tables, the 
division being made between marked and less marked anterior pigment : 


Variates supposed Variates supposed to be 
Classes to be discrete, continuous and Gaussian, 
Yulean @ Tetrachoric 7, 

Father and Son ... a0 37 “1545) 
Father and Daughter... 28 "44 
Mother and Son ... is "32 “48 
Mother and Daughter ... "B34 “D1 
Mean a cs 33 *A9 


Mr Yule’s statement that the correlation is “something like 4, not $” amounts 
to the denial that the variate is continuous; and the slightest inspection of our 
diagrams shows that the variate is continuous; the question therefore turns on 
whether the Gaussian assumption gives a reasonable approximation to the influence 
of this continuity in increasing the correlation. We have shown that the fourfold 
table result (fp) obtained by treating the variate as discrete requires, whether the 
distribution be skew or Gaussian, to be increased by amounts ranging from 37 °/, to 
80 °/,, when the value of ¢ is °3 or upwards. We have accordingly little hesitation 
in asserting that the true correlation exceeds 1 by at least 40 °/,;. Another way 
of approaching the problem is the one adopted by Pearson, when he found the 
want of stability in the tetrachoric 7, as applied to these eye-colour tables, the 
method of mean square contingency. 


* Note how the slope of this regression line is *5 for the main portion of Diagram VI. 
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Making the proper corrections we find: 


Order of Father Father Mother Mother 
Tabl Nature of Classification and and and and 

a Son Daughter Son Daughter 
Ue ley so Bh oa Sith 3.76 Bf) ‘b7 48 50 43 
6x6 1:2:3:4:5+6:7+8 58 50 “48 44 
5x5 142:3:4:546:748 755) 52, *5O 44 
Ist 4x4 142 :34+4:546:74+8 53 *50 "45 41 
29nd 4x4 142:3:4:54647+4+8 54 56 “50 45 
Ist 3x3 142 :3+4:5464748 52 “A5 45 “41 
2nd 3x3 142438 :4:54647+8 ‘60 ‘Bl 56 *5O 
Mean* = BD 50 “49 "44 
Values originally given by Pearsont 55 “44 “48 ‘1 


That the eye-colour tables present anomalies has been fully admitted, but the 
average mean square contingency based on 28 groupings of these four tables gives 
the mean value ‘4961; the mean given by Pearson originally from what he con- 
sidered and we still consider the natural fourfold division was 4947. We have 
no experience whatever of the average of a large number of corrected and wholly 
@ priort unselected contingencies giving a result too high by 33 °/, of its value! 
On the contrary, as we have shown on pp. 221—226 the contingency method 
appears on the average to give values slightly in defect of the true correlation. 


A study of our diagrams shows that it is in the very small groups 1 and 8 
containing only 25°/, to 45°/, of parents that the marked deviations from 
linearity of the regression occur, that is to say in the light blue-eyed and very 
dark brown parents’ offspring. We suggest here that a considerable number of 
the light blue parents may have been erroneously classified, on account of extreme 
age, and a considerable number of the offspring of very dark-eyed parents may 
have been classified as light blue because of extreme infancy. Both may really 
belong to the mediocre classes. It is only in the extreme classes of small fre- 
quency that the effect of this shifting of the mediocre would be sensible. The 
Diagrams IX and X (pp. 248 and 250)+ show that in the bulk of cases, from group 
2 to 6 or even to 7, the regression line has a slope of over ‘5; the inset figures 
show that this regression is maintained inside the brown group when we remove 
the blues. We have already given similar diagrams for the Brother-Brother table. 
These again show that in the centre of the figure the slope of the regression line 
is at least °5, while the defect at the tails very probably indicates the results of 

* Calculated from the corrected contingency to four decimal places. 

+ Phil. Trans. Vol. 195 A, p. 106. The division taken is that which for the use of a single division 
only is physiologically and statistically the most reasonable ; see above, p. 238. 

{ The Diagrams VI and IX were obtained by applying a normal curve to each array to obtain 


the position of the mean on the assumption that the range of groups 3 and 4 remains the same on 
the scale, 


vis 
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including cases in which one brother was an infant. The second diagram (p. 240) 
worked out by Mr Yule’s pseudo-ranks indicates also the heredity within the 
brown group when blues are excluded, but it obviously distorts the whole system 
and gives the spurious appearance of far less correlation. 

Another manner of illustrating the idle character of Mr Yule’s assertion that 
the true parental correlation is about 4 may be obtained from a comparison 
of the following tables : 


Totals 


335 
284 
137 
105 
139 


Totals 


N 
ar || 
cs) 
Oo 
eS 
Nn 
ak 
> 
~ 
+ 
oo 


B 


LR, USAR se rkO 41 9 21 335 
3 83 124 4] 13 23 284 
4 25 34 55 | 11 12 137 

5+6 27 12 19 24 23 105 

7+8 29 24 24 12 50 139 


Totals | 358 264 180 69 


C 1+2 3 4 5+6 | 7+8 | Totals 
| 
1+2 193 84 38 10 10 335 
3 100 85 54 19 26 284 
4 34 39 31 12 21 137 
5+6 18 28 25 ial 23 105 
7+8 13 28 32 17 49 139 


Totals | 358 | 264 | 180 | 69 | 129 | 1900 


Now the changes that will convert A into B are given by the scheme: 


B-A 142 3 4 5+6 7+8 
142 +32 -17 = ih - 7 = 1 
3 -—19 +46 -ll — 6 -—10 
4 -—16 -— i: +27 0) - 8 
5+6 Oo | -16 -— 3 15 + 4 
7+8 +3 #—-10 — 6 - 2 +15 


That is to say to obtain B from A we must take between } and } of the 


material of A and transfer it to the diagonal cells from those cells away from the 


diagonal, the only failure of this rule is the +3 in the cell 1+ 2:7+8. There 
31—2 
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can be no doubt that the correlation of Bis very much greater than the correlation 
of A. To convert @ into A the changes are given by the scheme: 


B30 \ per? 3 | p Bag ees 
——— a 

14+2 + 1 — 14 + 3 - il +11 

3 2297 | Seatage Sle) Mags cl hernia emeaaes 

J =\G) (|) =) Bel WEode Ae le eG 

BiG 1) eo een TG | 5G mers 0 

Fie a) eae. | eonadee eas 5a ape eal 


Here again the candid reader will, we believe, admit that B has more correla- 
tion than C. The general difference is again a transfer to the diagonal cells of 
frequency to the right and left of this diagonal, the total number so transferred is 
78; but the transfer is rendered more complex by the appearance of 36 units in 
the cells in the top right-hand and bottom left-hand corners. This transfer is 
only half the size of the other towards the diagonal, and it probably measures 
the frequency of senile blues in the parents and of infantile blues in the offspring. 
We hold that we must still credit B with more correlation than C, although this 
secondary movement can now be well recognised. JB is the contingency table for 
Father and Son’s eye-colour (see p. 186). A is a table for a Gaussian surface with 
‘3 correlation, C is a similar table for a Gaussian surface with ‘5 correlation, both 
being adjusted to give frequencies to unit places only with the same marginal 
totals as the Father and Son eye-colour table. These comparisons suffice to 
indicate that the correlation of Father and Son is far greater than ‘30 and is 
probably slightly greater than °50. 

Precisely similar treatment of the Brother-Brother eye-colour table leads to 
like conclusions, as the reader may judge from what follows. 


Consider the three tables : 


A 14+2 3 4 5+6 | 7+8 | Totals 
14+2 931 
2 901 
J 443 
S+6 297 
74-8 428 
Totals 3000 
B ete, 3 4 5+6 7+8 | Totals 
142 | 496 | 294 63 68 80 931 
3 924 | 418 97 56 | 106 901 
J 63 97 | 168 47 68 443 
546 68 56 47 70 56 297 
748 80 | 106 68 56 | 118 428 
—————— SSS. el 
Totals | 931 | 901 | 443 | 997 | 428 | 3000 
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5 | T+8 | Totals 


40 931 
Sr 901 
76 443 
66 297 
149 428 


428 3000 


Now mark what change must be made in A to correct it to B,; this is 
given by the scheme: 


BoA 2 eal ae 5+6 | 748 
| 
142 +98 | — 59 ~ 50 42 +9 | 
3 —59 | +137 ~38 —31 ~9 
4 -50 | — 38 | +96 | —- 3 ht 
SEG wei Ok = BI ee em || 
Peed: | a= 36 a; -2 | +7 | 


Can there be any doubt that the scheme B— A marks immensely increased 
correlation? To pass from A to B, we must accumulate 372 individuals along the 
diagonal as against 11 individuals drawn towards the corners away from this 
diagonal. We take it that only the most captious person could possibly deny 
that to reach B from A, we must transfer individuals in a manner which markedly 
increases the correlation, 


Now consider the change which must be made in C to obtain B. It is given 
by the scheme: 


B-C fea 3 4 546 7+8 
1+2 +25 — 54 = 83 +20 | +40 
3 45 ee TS) = 43 —30 4) 
4 —3l — 43 +90 = is] — 8 
5+6 +20 = 310) — 8 5 —10 
T+8 =27(0) || a5) —-8 | -10 | -31 


To pass from C to B we must transfer 261 individuals to the diagonal; but 
there is an outward movement of 69 to each corner and a resulting defect of —31 
in the fifth diagonal cell. There is a total movement of 261 individuals towards 
greater, and of 169 towards lesser correlation. We should have no hesitation in 
saying that the correlation of B is higher than that of the C table. We would go 
further and say that until we find a table D in which these movements towards 
the diagonal and towards the outer corners nearly balance we have not yet reached 
a table with a correlation equal to that of B. Consider the table: 
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D 14+2 3 4 5+6 | 7+8 | Totals 
931 

901 

443 

297 | 

428 | 

3000 | 

We have the scheme: 

B-D 142 3 4 54+6 T+8 
1+2 + 2 — 51 —-25 | +26 +48 
o —51 +110 —45 — 30 +16 
4 —25 — 45 +87 — 9 -— 8 
| 5+6 +26 | —- 30 | — 9 | +426 —13 
T+S | +48 + 16 = 8 | —13 — 43 


Here 225 individuals must be transferred to the diagonal to get from D to B, 
i.e, there is an increase to this extent of the correlation, but 223 have got to be 
transferred outwards to the ends of the other diagonal, although not in such a 
concentrated fashion. There can be small doubt, we think, that the correlation of 
B is nearly, if not slightly greater than, that of D. B is the correlation table for 
Brothers, A, C and D have correlations respectively of ‘28, °45 and 50. We feel 
confident in asserting that the unknown correlation of B is far nearer to that-of 
C or D, than of A, which Mr Yule gives as its limit! Nor can the reader who 
examines these tables fail to obtain true insight into the peculiar nature of the 
correlation of B. It is clear that B fails of normality because there is an excess of 
dark brothers having brothers with blue eyes. The excess is greater than in the 
case of father and son and this is actually what we should expect, if the excess be 
due to the inclusion in the record of infants. For an infant will be reckoned 
once as a son, but if he is one of n brothers, he will appear n(n—1) times. If 
our surmise be correct there may be no failure at all of approximately Gaussian 
frequency in these eye-colour tables, but anomalous lumps in the second and 
fourth quadrants due to this unconscious inclusion of infants. If these lumps, 
which the inclusion of a dozen to twenty infants would suffice to produce, be 
removed, we should expect the correlation to rise as in the Huxley Lecture to 
about ‘60. What the final value would be after the correction for the more 
gradual changes in eye-colour, which occur after infancy*, remains to be deter- 
mined; we may be quite certain, however, that it is likely to be 80 to 100°/, 
above Mr Yule’s maximum of °28. 


* The influence of age on parental and fraternal tables of eye-colour will be shortly considered 
de novo. 
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But let us look at the matter from other standpoints. We have the stature 
data for Father and Son arranged in the eye-colour groups. Let us arrange it 
in a fourfold table, precisely as we did the eye-colour in order to demonstrate 
Mendelism, i.e. 

Stature of Father. 


3 | Totals 

M 

ey & | 
STO os ok. 619 | 
| £45+6+7+8 381 
is —E 
8 | Totals 1000 | 
D | | 


We may give these a class-name, say, Short and Tall—corresponding to Short 
Muzzle and Long Muzzle, or to “Over 30” and “Under 30” ege hens, ete., ete. 
We have at once a Mendelian table: 


: —— 
Short Fathers | Tall Fathers Totals 


Short Sons 619 
Tall Sons 381 
| 


Totals 1000 


Now assume a discrete unit between Short and Tall and find the Boas-Yulean ¢. 

It is 
go = °335, 

and as its Mendelian value should be °333, the agreement is extraordinary! Tall- 
ness and shortness in man are clearly Mendelian presence and absence of a unit 
character! The eye-colour tables at the same divisions show exactly the same 
result. Surely Mr Yule is satisfied that the correlation of stature in man is of 
the order 4? Yet the tetrachoric 7, in these cases at the same division gives for 
eye-colour ‘55 and for stature ‘51, 1.e.is some 60 to 70°/, greater. Now let us 
treat these two cases by an identical process; we use a normal horizontal and a 
normal vertical scale, and we determine all ranges in terms of the range of the 
frequency of groups 3 + 4 treated as of length h. We find the mean values of the 
arrays of sons for each group of fathers by assuming that the means will be 
approximately given if the array be treated for this purpose as Gaussian (see 
p. 236). The results are given in the table on p. 249. 


Can any reader who examines these data deny the remarkable parallelism of the 
two cases? Let him also look at the Diagrams VIII and IX (p. 248) drawn for 
stature and eye-colour and ask himself whether it is possible to make any distinc- 
tion between the cases of inheritance of eye-colour and inheritance of stature, 
beyond the irregularity depending on the group of 36 cases (3°6 °/, of the whole) 
of dark blue-eyed fathers. The two regression lines run with almost complete 
agreement, the stature data being a little more regular than the eye-colour data. 
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Diacram VIII. Regression in Stature. Father and Son. Based upon means obtained by 
Gaussian assumptions from graded but unmeasured stature groups, equal to the eye- 


colour groups. 


8 


(Tall.) 
~] 


bt6 


Son. 
ide) 


(Short. ) 


(Tall.) 


(Short.) Father. 
Regression in Eye-colour. Father and Son. Based upon means obtained by Gaussian 


Dracram IX. 
assumptions. To be compared with the like regression for stature. 


8 
a 
2 
Q 536 
4 
a 3i 
@---. 
=e) 
3 5+6 
1 <== 


(Light.) Father. 
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Mean Son Eye-Colour | Stature 
Group 1 ee ix pee — *4336/ — *8255h* 
~ 2 aes het ee DOGD/ — ‘4778h 
5 3 — ‘0166; — *0246/ 
7 4 an fe ane + *1319/ + ‘2686h | 
» O+6 oe an jos |) ae OY + 611k | 
. i | + *7513h + *7358h 
x 8 +1:1414h +1:0455h* 
Standard Deviation oc, re oy oe *8247h 8247h 
” a Os int Sar aoe *8932h *8932h 
n uncorrected from above... aes as 5217 5074 
7 corrected eae me fae 534 RC 5420 5224 
Slope of Regression Line, from 7 used as 7... | “5870+ 5658 
True value of r ate ae a ae ) 5189 
Tetrachoric 7+ ae Set te an 5503 5104 


Can any one continue to assert with Mr Yule that the correlation for eye-colour is 
about 4 and that for stature is 4? Is it not clear that for eye-colour in Father 
and Son the value given by Pearson in 1900 is within ‘02 of the true value ? 


A similar figure (Diagram X) is given on p. 250 for Mother and Son. This is more 
irregular in the terminal groups (which contain 3°6 °/, and 4°6°/, of the frequency 
only), but it shows the same points. The uncorrected 7 ="4930, the corrected 
» = 5109 and the regression is 5459. These are in quite good agreement with 
the value of r as found from contingency, i.e. ‘4885 (see p. 241), but differ slightly 
in excess from the value given by the tetrachoric r, for the “ Mendelian” table, 
ie. correlation ‘4817 and regression 5145, It is clear from the diagram that 7, 
as regards 7, has been lessened by the deviations from linearity in the terminal 
classes§. Still the slope of the regression line shows that we are far from dealing 
with a correlation of + for there is approximate equality in the variations of the 
eye-colour in mother and son, o,,='8574h and og =‘9161 h. 


(12) The Vaccination Data. 


We now turn to the question of vaccination and recovery from small-pox. 
Mr Yule apparently considers either his coefficient of association, or his coefficient of 
colligation, to be the right coefficient to use here. Now both these coefficients are 


* The means in the case of the two terminal arrays had to be somewhat differently treated as there 
is no frequency in the first array of groups 5—8 and in the last array of groups 1+2. Accordingly if the 
h of the table be 23,4, the mean of the first array was found in terms of hg, i.e. the range of group 2 
for that array, and the mean of the last array from hs4¢,7 for that array, i.e. the range of groups 5+6+7 
for that array. ho and his,gi7 were then expressed in terms of h3,4, ie. of h of the table by means of 
the relations between these subranges in the marginal total frequencies for sons. 

+ To four figures the value as given in 1900 was ‘5159. 

~ For divisions as originally given in the Phil. Trans. memoir, Vol. 195 A, p. 106. 

§ See remarks, p. 164 above, on the true measure of correlation. 
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unchanged by the artificial selection in which Mr Yule sees* “a most important 
property and one of special importance in such cases as those I have chosen for 
illustrations,” i.e. vaccination and recovery. On the other hand the ¢, or pseudo- 
rank correlation, is for a fourfold table immensely influenced, as we have shown, by 
selection of this artificial kind. If this ¢, however, be artificially selected, so that 
it deviates widely from the ¢ of the data actually provided, then this artificial 
value of ¢—for a table, say, in which the number of vaccinated has been made 
equal to the number of unvaccinated and the number of deaths equal to the 
number of recoveries—is Mr Yule’s coefficient of colligation. From our stand- 
point it is hard to conceive a stronger argument against this coefficient of 
colligation than the fact that it is unequal to @, unless you have artificially 


Diagram X. Regression in Eye-colouw. Mother and Son. Based upon means obtained 
by Gaussian assumptions. 


aes Pe FT a 


Ss | eres 

8 ; PORE 

= OkO | / »s 
cet | LE 


Son. 


(Light.) 


1 2 3 4 586 Ten 8 
(Light.) Mother. (Dark.) 


doctored @ to bring the two into agreement. What is the probable error of 
¢ thus doctored or how is it related to the probable error of @ found from 
the undoctored material? Mr Yule, however, writes: “We have therefore the 
important theorem briefly mentioned without proof in p. 17—the coefficient of 
colligation w for any table is the product-sum correlation r+ for the equivalent 
symmetrical table. These two coefficients r and w form, accordingly, a natural 
pair, the first giving the actual correlation in the given table, the second the 
correlation in a derived table of standard form, thus enabling us to compare the 

* Loe, cit. p. 587. 

+ Mr Yule here as elsewhere terms ¢ the ‘‘ product-sum correlation” and uses for it the letter r. 


This is wholly unjustifiable, it is merely a correlation of pseudo-ranks and not of true variates 
at all. 
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two tables freed from the effects of ‘selecting varying proportions of A’s and B’s’” 
(loc. cit. p. 597). The italics are Mr Yule’s. We have rarely come across more 
specious reasoning. <A coefficient is selected which for one type of artificial 
selection is not changed, and this peculiarity is termed a special and important 
feature of the coefficient. Another coefficient which is intensely subject to this 
selection is then commended because it can be selected so as to agree with the 
first; and Mr Yule terms them a “natural pair.” Algebraically Mr Yule starts 


with the table ong and the Q for this is (ad —bc)/(ad + bc). This value of Q) 


e| da’ 
remains unchanged if we write the table are where the marginal fre- 


quencies have now been rendered artificially all the same. The original @ however 
was 


(ad — be)/V(a + b) (a + c) (b+ d) (c+ d), 
and has changed to 


(Vad —Vbc)/(Vad + Vbc), 


which is Mr Yule’s colligation and a function of Q. The original ¢ has many 
important properties, it is Pearson’s mean square contingency for a fourfold table 
and determines the probability of the two variates being independent; it is also 
the correlation of means if they be measured from their dividing lines in terms of 
their standard deviations, supposing the material continuous and approximately 
normal. The new ¢ possesses neither of these really important properties, it is 
the ¢ of the table after most artificial selection, and that it agrees with Mr Yule’s 
coefficient of colligation neither gives validity to that coefficient, nor endows it 
with any new property whatever. 


Mr Yule has not taken the trouble to see what sort of effect his artificial 
selection really has on actual material. We propose to illustrate it on certain 
vaccination data. We give, as Table X XI, the table of Glasgow data published in 
Biometrika, Vol. vit. p. 257. It is not true that the haemorrhagic and confluent 
cases all die, but a very large percentage of them die and it would be not far 
from the fact to represent the data by a fourfold table : 


Deaths | Recoveries Totals | 


Vaccinated : 
Unvaccinated ... 


Totals 


This table is only illustrative, not, of course, a rigorous experience. 


Now we have not assumed Table XXI to be a Gaussian distribution, but we have 
found the means of the arrays by assuming each to be represented by a Gaussian 
curve, a process which, as we have seen, gives a fairly close result even for skew 


material. We have also, in order to get scales of a reasonable character, arranged 
32—2 
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TABLE XXII. Severity of Attack and Strength of Immunity due to 


Vaccination City of Glasgow Hospital. 
Severity of Attack. 


, = Haemorrhagic | Confluent | Abundant Sparse | Very Sparse | Totals 
= 

ie 0—10 30 
O'n 10—25 457 
ES Q5—45 932 
fe, 1S Over 45 155 
° & | Unvaccinated 115 
oe 

me) 

S Totals 1689 
Ss 


Diicram XI. Regression of the intensity of vaccination upon severity of attack and of severity of attack upon 
intensity of vaccination, showing the continuity of the relationship. Measured on Gaussian scales. 


0-10 


10-25 
ae Sa) oa ce 


25-45 


Intensity of vaccination measured in period since vaccination. 


Unvac- 
cinated 


| 


Mea of -groufe 


Haemorrhagic. 


Confluent. 


Abundant. 
Severity of Attack. 


Sparse. 


Very Sparse. 


Kart Prarson AND Davin HERon 253 


the severity of attack and immunity due to vaccination on Gaussian bases. This 
is done in Diagram XI, and the corresponding regression lines are drawn. These 
were calculated by supposing the range of the mid or ‘abundant’ group the same 
for all arrays of immunity, and the group 25—45 the same for all arrays of 
severity. It will be seen at once from this diagram that the division into 
vaccinated and unvaccinated is a merely verbal one, that Mr Yule is playing with 
words, not dealing with the realities under class-names*. Immunity and severity 
are continuously changing quantities and they vary in almost linear relationship to 
each other, the greater the intensity of the immunity associated with vaccination 
the less severe is the attack. We do not think the deviations from linearity 
of regression found in this case differ sensibly from the sort of values that 
frequently occur in the case of the regression line found by the product-moment 
method. 


In order, however, that Mr Yule may not attribute this result to the use of the 
Gaussian distribution to find the means, we applied his method of pseudo-ranks to 
determine the means of the arrays and then put in the regression lines on his 
hypothesis that the rank of a huge bracket is a suitable unit in which to measure 
correlation. This is done in the lines A of Diagrams XII and XIII. Although 
the previous scheme is much distorted, we see that on Mr Yule’s own hypothesis 
the two variates behind the “ vaccination—non-vaccination” and “death—recovery ” 
classifications are continuous, steadily increasing in one direction, and that there 
is the correlation of a continuous two-variate system behind them. 


Now let us proceed to do exactly like what Mr Yule has done, namely make 
divisions at vaccinated and non-vaccinated, and between confluent and abundant, 
and dress the table so that there shall be equal numbers of vaccinated and un- 
vaccinated and of the very severe and less severe cases. The table thus transformed 
is Table XXII. The difficulty now is to know what to do with the marginal scales ; 
if these in the natural condition were normal, they are hardly likely to remain so 
after selection. But these scales—especially as we have found the correlations. by 
the 7-process—are not of first class importance except for exhibition of the results 
graphically. We have therefore retained the old scales and merely calculated the 
means of the arrays condensed at the old means of the old scale of years since 
vaccination (see Diagram XIV). In order that the reader may see how little 
assumption is thus made we have had a second diagram drawn indicating what 


* «¢The division may also be vague and uncertain: sanity and insanity, sight and blindness, pass 
into each other by such fine gradations that judgments may differ as to the class in which a given 
individual should be entered. The possibility of uncertainties of this kind should always be borne in 
mind in considering statistics of attributes: whatever the nature of the classification, however, natural 
or artificial, definite or uncertain, the final judgment must be decisive; any one object or individual must 
be held to possess the given attribute or not.” Theory of Statistics, pp. 8, 9. Similar words are used 
by Mr Yule in Biometrika, Vol. 1. p. 121: ‘‘The judgment must however be finally decisive ; inter- 
mediates not being classed as such even when observed.’’ We can hardly conceive statements more 
liable to prejudice the mind of a wavering recorder of actual data. No wonder Mr Sanger said that 
““Mr Yule’s work would be the work for Mendelians” ! 
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TABLE XXII. Immunity and Severity of Vaccination Table after a Yulean 
Equalisation Process. 


Eerod ane Haemorrhagic | Confluent | Abundant Sparse Very Sparse Totals 
2°8 so 5°6 
52°8 76°4 63°0 
138°5 124°1 83°8 
22°2 15°3 130 
198°4 33°99 | 9°6 
Totals 76°9 767°6 | 414°7 254°8 175°0 


MA aes —— —— 


= ee ‘ 
844°5 844°5 

Diagrams XII anp XIII. Intensity of Vaccination and Severity of Disease treated by the method of 

pseudo-ranks. 
Assumed unit grades. (A) original data, (B) Haemorrhagic and Confluent made 50 °/, 
and vaccinated made 50°/,. 
(i) Regression of intensity upon severity. (ii) Regression of severity upon intensity. 
Dracram XII. 
(1) 
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Diacram XIII. 


(ii) 

0-10 

= 10-25 
S 

g 25-45 
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S 
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cinated 


Haemorrhagic. Confluent. Abundant. Sparse. Very Sparse. 
Severity of Attach. 


changes would have been made had we used as horizontal scale a Gaussian calculated 
on the new distribution of total severities. It will be seen on comparing Diagrams 
XIV and XV that no substantial difference is made in the fundamental result. 
Further to test the effect of concentration we have had 7 the correlation ratio 
calculated by concentration at the means of the unselected scale of years since 
vaccination and by treating each array as a Gaussian; the value found for it in 
the former case (when uncorrected for class-index correlation of severity) 1s ‘32, 
while in the latter case the three-rowed table gave » = ‘32 and ‘34, with a mean 
value °33*. Both these values would need the same class-index correction for 
severity, 1.¢. 7,¢ = "9502, which renders them ‘34 and ‘36 respectively. 


Further, to show that a wide range of hypotheses can be made without 
modifying the fundamental conclusion, we have again plotted the regression lines 
on the Yulean hypothesis of pseudo-ranks before and after equalising deaths and 


* Biometrika, Vol. vit. p. 257. The value of r found by concentrating at the Gaussian means of the 
marginal subranges is ‘34, quite close to the 7 values as corrected. 
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recoveries, vaccinations and non-vaccinations (see Curves B of Diagrams XII and 
XIII). The fundamental result is obvious in all these diagrams, the selection, 
which does not change Mr Yule’s coefficients, changes widely the true relationship 
as measured by the regressicn lines between the variates immunity conferred by 
vaccination and the severity of small-pox. Take a given grade of severity of the 
disease, and the grade of immunity which is associated with it is entirely altered by 
the selection, or take a definite grade of vaccination and the average severity of the 
disease is represented by quite different figures before and after selection. Mr Yule’s 
most important property of his coefficients—the property that they remain 
unaltered by this selection—is the very property which forces us to the conviction 
that they are wholly unsuited to use in such a case as that which he asserts is 
eminently fitted for their use. 


If we consider the matter algebraically, we find: 


0 Ta 
=n | Nl Yulean ¢ Contingency 
Rpcourzeted Corrected | Pucomecia Corrected | 
- | | =| = 
Before selection | 32 to °34 ‘34 to ‘36 26 34 BHI 36 
After selection “58 | 61 52 | ‘60 “50 58 
| 


We contend that a method which so substantially changes the real relationship 
between immunity and severity is wholly incapable of leading those who use it to 
any just inference as to the association of immunity and severity. It is only a 
logical quibble quite unworthy of the reputation of the man who uses it, and 
directed at an audience which had made no thorough study of the mathematics 
of statistics, to state as Mr Yule does that a man dies or does not die after 
incurring small-pox, and that he is vaccinated or not vaccinated. Death and 
vaccination are crude class-indices of severity and immunity, and Mr Yule’s 
coefficients tell us nothing of the real relation between the two variates which 
is what we at least are seeking. 


(13) On the Stability of Coefficients of Association. 


If we have, as we believe we certainly have, in this paper succeeded in 
demonstrating the idleness of Mr Yule’s coefficients of association and colligation 
for any purpose of applied statistics; if, as we hold, they are merely of interest 
from the standpoint of symbolic logic, i.e. in the discussion of verbal classifications, 
not in the treatment of the real things lying behind class-indices ; the reader may 
naturally ask how we propose to treat similar problems. The answer to our mind 
is fairly clear. When the variates truly advance by unit grades, then there is no 
difficulty whatever about the problem. The right method to use is the product- 


* Calculated by concentrating the contents of each cell at the Gaussian centroids of the subranges 
of the marginal totals and correcting for class-index correlations ; see Biometrika, Vol. 1x. p. 119. 


Kart Prarson AND Davip HERON 257 


moment correlation. This method has been used from the “earliest times” by 
biometricians, e.g. in dealing with the teeth on the carapace of prawns, the 
prickles on holly leaves, stigmatic bands on poppies, veins on leaves, etc., etc. 
There is absolutely nothing new in the Yulean method of pseudo-ranks applied to 
these data, the rank is the true measure of the variate in such cases. But the 
method is wholly false when applied to continuous variates in such cases as 
Mr Yule applies it. It is wholly false in particular when applied to fourfold 
tables, unless the difference between the two classes is a measurable unit. This, 
as we have indicated, is not the case in the Mendelian results of actual practice ; 
it is not the case in the vaccination data or in any one of the cases to which 
Mr Yule has applied his coefficients. In no cases is he dealing with discrete unit 
differences. The nearest approach to a discrete difference is possibly in the case 
Diacrams XIV anp XV. Regression of Intensity of Vaccination upon Severity of Small-pox. 
(A) Original data. (B) Vaccinated made 50°/, and haemorrhagic and confluent 50°/, 


(Yule’s Hypothesis). Intensity of Vaccination and Severity of Attack treated by Gaussian 
Methods. 


(i) Assuming ‘‘severity” has a Gaussian distribution after the change of frequencies. 
(ii) Assuming the centroids of each severity group unaltered by the change of frequencies. 
Diacram XIV. 
(1) 


— 


O=10 


Years elapsed. 


Intensity of Vaccination. 


an 


Haemorrhagic. Confluent. Abundant. Sparse. Very Sparse. 
Severity of Attack. 
Biometrika rx 3h} 
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Diacram XV. 
(1) 
0-10) a 


Years elapsed. 


Intensity of Vaccination. 


cinated 


Haemorrhagic. Confluent. Abundant. Sparse. Very Sparse. 


Severity of Attack. 


of sex, or in the use of a spear or a sword in negro tribes as alternatives, although 
even in these cases itis hard to understand why the male is a unit in excess or 
defect of the female or why a spear is a sword plus or minus a unit. But if we 
admit this, then @ is the appropriate coefficient to use, and not Mr Yule’s 
association or colligation, and it should be applied to tbe original data and not 
to the adjusted or equalised table. Personally we should never use ¢ in such 
cases; we should measure the probability that the variates were independent, 
ie. deduce the P from ng? by Elderton’s tables, and this would guide our judg- 
ment in the matter *. 


* We can get rid of the main effect of influence of total number of cases considered, and of the 
inequality of the marginal grouping, by adopting the method proposed by Pearson and thinking on a 
correlation scale: see ‘‘On a Novel Method of regarding the Association of two Variates classed solely 
in Alternate Categories,” Drapers’ Company Research Memoirs, Biometric Series VIII, Dulau & Co. 1912. 
In that paper the values of the correlation on the probability scale rp were only tested against tetra- 
choric 7; for such totals and divisions as occur most frequently in everyday practice. A caution must be 
given here against the extension of that method without fuller investigation of such cases to divisions 
giving small percentages in the marginal frequencies. For such cases we know that tetrachoric 7, gives 
poor results. 
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But the cases of approach to discrete differences are exceedingly few in number 
compared to the mass of cases with which we have to deal; they exist as a rule 
only in the class-names, not in the things classified. Personally we have rarely 
found them except as already stated in theoretical Mendelian investigations, and 
to such cases the method of ranks, i.e. use of a ranks coefficient whether for four- 
fold or manifold tables, was first and rightly applied by Pearson. The Biometric 
School criticise not the application to Mendelian theory of these methods by Dr 
Brownlee and others, in which they only fellowed what had already been done, but 
Dr Brownlee’s applying tetrachoric 7, to such theoretical tables and then supposing 
that he had got at the root of the difference between theoretical and actual heredity 
correlation tables! Mr Yule nowhere distinguishes between a theoretical Mendelian 
table and what we can absolutely demonstrate not to be theoretical Mendelism, 
e.g. Pearson’s pigmentation tables. He writes: “As regards Dr Snow’s recent 
comments in Biometrika on the use of the normal coefficient for Mendelian 
tables in Dr Brownlee’s paper, he really thought that those comments were a 
much stronger condemnation of Professor Pearson’s than of Dr Brownlee’s work. 
Professor Pearson had repeatedly used the normal coefficient for inheritance 
tables, that were in all probability representations of Mendelian inheritance, as 
if it were an approximation to the product-sum correlation” (loc. cit. p. 651). 
It is a pity Mr Yule has not the courage of his opinions, and did not assert that 
the eye-colour data had a correlation of the Mendelian value } because they were 
Mendelian. Instead of that he throws the sop of 4 to the Mendelian Cerberus, 
having carefully produced it not from the proper fourfold Mendelian table, but by 
applying the method of pseudo-ranks, which involves just the same assumption 
of continuous variation and regression beyond the Mendelian divisions into unit 
characters, as is involved in the tetrachoric r,, What Mr Yule means by “in all 
probability representations of Mendelian inheritance” is eloquently demonstrated 
by our discussion of the case—eye-colour in father and son—which he has himself 
selected to illustrate the Mendelian }. 


If we leave the Mendelian theoretical table and any other truly discrete 
fourfold classifications, which are indeed difficult to find, and pass to the fourfold 
classification in general, what method are we to use? We assert that the four- 
fold or tetrachoric 7, is a coefficient of association infinitely superior to Mr Yule’s 


old Q or new oa. 


Mr Yule says he attaches no importance to the fluctuations of Q and @ for 
different divisions of the same table. If so, how can he usefully compare the 
values of Q or found from two tables with different divisions? If so, on 
what ground can he complain of the use of tetrachoric r, because in certain 
cases it fluctuates for different divisions of the same table? He admits that 
“the coefficients of association and of colligation for different divisions of the 
same table in many cases fluctuate more largely than the normal coefficient” 
(loc. cit. p. 651), yet the only illustration he gives in his paper is that of the 

33—2 
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table for eye-colour in brothers, for which he discusses the relative fluctuations 
of his coefficients and tetrachoric r, by a method which from the standpoint of 
scientific statistics is wholly inadequate. He states that “as soon as we leave the 
narrow field within which normal or ‘strained normal’ correlation holds good, the 
normal coefficient fluctuates as we change the axes of division quite as largely 
as any other coefficient” (loc. cit. p. 683). How does this “narrow field” tally 
with the “many cases” in which these other coefficients fluctuate more largely ? 
Mr Yule has purposely selected three or four markedly skew distributions to 
show how the tetrachoric 7, fluctuates; why did he not test adequately and 
completely its fluctuation as against those of the coefficient of association on 
this material? Would he ever have written his above dogmatic assertion had 
he done so? 


Now we do not for a moment agree with Mr Yule that there is no importance 
in the fluctuations of a coefficient of association for different divisions of the same 
surface. On the contrary we assert that a true coefficient of association should 
be as stable as possible, that is to say that for a given surface it should have 
fluctuations which are within a reasonable range indicated say by twice its probable 
error. Indeed, for most practical purposes fluctuations of ‘05 are of small im- 
portance. It is purely idle to do as Mr Yule has done, namely proceed to test 
such fluctuations by their ranges or their standard deviations obtained from the 
raw values. Each value must be considered in conjunction with its probable 
error. @=1 and r=‘70 are not comparable if the probable error of the first is 
zero and of the second ‘05, the weight of the first observation is infinite and of 
the second finite. Mr Yule has compared such results without any regard to their 
relative weights. 


Starting with tetrachoric r, we have a definite surface, the Gaussian, either in 
its original or strained condition, for which there are no fluctuations in 7;, except 
such as might arise from random sampling. No such surface has been discussed 
by Mr Yule for Q or ¢. But is an approximation to the Gaussian surface a 
rarity? Does it only occur in “a narrow field”? On the contrary it covers 
within the approximations required in practice a very wide field, namely nearly 
all distributions in anthropometry and many characters in plant, insect and 
animal forms. Even the irregularities of the eye-colour data may well turn out 
to be due to neglect of the age-corrections and not to failure of the Gaussian 
system. Can Mr Yule produce, even with careful seeking, actual distributions, 
in number one-tenth of the Gaussian cases just referred to, for which Q and 
therefore are practically constant for all divisions? There is no reasonable 
and logically consistent theory of deviations which leads to a surface of constant Q, 
although there is such a theory leading to a surface of constant 7;. We hold that 
stability of an association coefficient is not only a desirable, but an essential part 
of any true theory of association, and the fact that one theory does give a relatively 
stable coefficient for a large section of material is immensely in its favour. 
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But there are two other points about tetrachoric 7, which also in our opinion 
weigh much on its side when comparison is made with Q: 


(1) It is modified by every form of selection and thus corresponds to our 
experience of every true measure of relationship. If association is to be of profit, 
it should pass into correlation as our detailed knowledge of the material becomes 
greater. All Mr Yule’s association achieves is the passage with increasing 
knowledge into manifold and contradictory diversity. Every selection he has 
made on the basis of lesser knowledge must become an increased source of 
contradictory values, as he reaches more detailed knowledge of his material. 
There is a systematic variation of @; it rises continuously from median to 
extreme divisions in every distribution on which we have tested it. That is 
to say, there is a wide divergence of practical statistical data from the surface 
of constant @, and this in a definite direction. Nor is this to be wondered at, 
for the surface of constant @ has marked heteroscedasticity, and markedly curved 
regression: see Appendix III. 


(ii) <A true coefficient of association should not necessarily become perfect, 
when one of the four quadrants of the fourfold table becomes zero*. Whatever 
the degree of dependence of two variates may be, the frequency surface in practice 
is limited in extent, therefore by taking small percentages of one or both variates 
in the marginal totals we can always in practice make one quadrant zero. We 
hold indeed that a fitting coefficient of association need not necessarily be perfect, 
even if two opposite quadrants have zero frequencies ft. 


In both of these respects tetrachoric 7; is superior to the coefficients of 
association or colligation. The one aspect in which it is inferior is the labour 
of calculation, but the tables published by Everitt and an additional table which 
we hope shortly will be published render the labour by no means severe. 


In order to establish our position it remains to show how even in the ex- 
ceptional cases selected by Mr Yule—which do not represent the run of ordinary 
practice—the tetrachoric 7, is much more stable than his Q. 


* Consider the following Table for true 1='52: 


Father’s Stature. 


2 Under 71°5 | Over 71:5 Totals 
3 

S| Under 65°5 ... 1275 0 127°5 
m | Over 65°5 oe 863-5 87 950°5 | 
‘g —Ea 
iS) 

on) 


Totals... 991 | 87 1078 

What real knowledge do we gain by saying that association is perfect between ‘‘ tall’ sons and 
‘tall’? fathers, when it arises solely from the extreme position of the father-division not giving any 
content to a second quadrant on a limited surface of imperfect correlation ? 

+ See p. 177 and later Appendix I. 
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(A) 


Barometric Data. 


The following table gives the value of tetrachoric 7, and of the coefficient of 
association Q*. 


Southampton and Laudale. 


Divisions Tetrachoric r; etc ee Divisions Tetrachorie 7; Pees 
29°15 6968 + 0378 ‘9371 + 0144 29°95 "7991 + 0096 "8868 + ‘0075 
29°25 °7254 + :0291 9335 + 0124 30°05 "7954 + ‘0100 *8831 + ‘0075 
29°35 7476 + ‘0220 9244 + ‘0108 30°15 ‘7971 4 °0110 *8867 + (0075 
29°45 ‘7410 + 0215 ‘9067 + -0109 30°25 8112+ 0127 +9109 + ‘0070 
29°55 7303 + ‘OL77 8847 + O111 30°35 °7983 + °0149 9229 + ‘0076 
29°65 *7559 + °0140 8866 + ‘0097 BOYS 8300 + -0176 9579 + ‘0061 
29°75 *7848 + 0114 8951 + -0083 30°55 8452 + °0231 ‘9761 + 0050 
29°85 ‘T917 + 0102 8874 + ‘0079 — — —- 


* We have not thought it needful to deal in every case with both Mr Yule’s coefficients. Since w is 
a function of Q, and its probable error is a function of the probable error of Q, there is no occasion to 
do so, and it opens a whole field for the display of statistical fallacy. Consider two quantities 7 and p 
defined in their mutual relations by r=(1- ¢)/(1+¢) and p=(1—e!/")/(1+e!/”) ; then if we arrange a 
fourfold table so that r is positive, e will lie between 0 and 1 and p between 0 and 1. 


e=(L-r)/(L+7), "= (1—p)/(1+p), 
or taking logarithmic differentials we find 


Clearly 


i aL 
og /€=20,/(1—r?) and 7 Vel €= 20 pl (1 — p”) 
1 F ‘ 
o (On ay 
Thus it follows that Ly Oe ica a 
(jy nil 


Now if n be positive 7 is always greater than p; it follows accordingly that 1/p—p is greater than 
1/r—r, but if we have any given series we can choose the value of n so great that o,|p is less than 
olP 
7 


NA Y ei 12 
o,(7- For if'w=—~, then w=—-——,_ —— 
o>| nye € 


1 lin 1 
7m 1_ein 2n 


As n becomes large, then the limit of 


’ 


9 


5 pee 1- 
and therefore Limit of Dare | 

aIle 
which can be made as small as we please. 


Thus if we have any system of tetrachoric rs, it is always possible to devise a new system of p’s, 
obtainable by the relations 
e=(1-r)/((l+7) and p=(1-el™)/(1+el/), 
which have oolp less than o,/r. Thus, if we generalise Mr Yule, Q=(1- «!/”)/(1+«!/”) satisfies all his 
conditions of a coefficient of association, but we can always select n so that g,/2 shall be less than 
oQ/Q or ¢,/w, where Q and w are Mr Yule’s coefficients. Similarly we can deduce from the tetra- 
choric 7, a definite function of it p, which has a ratio o,/p less than o,/r, and yet satisfies all the 


so-called conditions of association. But for Mr Yule, and many others, the ratio of a quantity to 
its probable error is a measure of its significance. Hence by a proper choice of n we can always 


Kart PEARSON AND Davip HrEron 263 


It will be seen that the values we have obtained are not wholly in agreement 
even to the second place of figures with the values obtained by Mr Yule, who has 
selected only 8 out of the 15 diagonal cases. Diagrams XVI and XVII show that 


Diacram XVI. Correlation of Barometer at Laudale and Southampton. Actual values 
of tetrachovic r;. 
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Dracram XVII. Mr Yule’s representation of tetrachoric 7;. 
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deduce from one of Mr Yule’s type of coetticients a second which has greater or less or no significance 


as the case may be. Further : 
ere 


og. =-— = 0,=U0, 
Po nil-r? ee 


el/n (1+ e)? : : ae: 
where u=— — -__*_, but as 2 increases this tends to take the limit : 


1 (1+x«)2 


n 4k 


’ 


or can be made less than unity. In other words, given r or Q, we can always choose a new quantity p 
or Q, which has a smaller absolute probable error. If colligation for a given series has a smaller 
probable error than @ or a smaller one than 7;, it is always possible to choose new functions p or Q, 
which are absolutely determined by 17, or by Q and w, which have still smaller probable errors, and 
yet are true Yulean coefficients of association, ranging between 0 and 1! The fact is that the term 
‘‘ probable error” has no meaning for these coefficients until Mr Yule has discussed the nature of their 
frequency distribution, which is certainly not Gaussian. In the text we have compared 7; and Q with 
their probable errors because it probably gives some rough estimate of their relative stability, and it is 
the test Mr Yule has himself chosen, 
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Mr Yule’s selection has been singularly favourable to his unjustified assumption 
that the points lie along a straight line; actually the points are extremely steady 
between 29°85 and 30°35, and tail off to the terminals, where the percentages in the 
smallest quadrant are much smaller. However, apart from the question of linearity 
of the distribution of 7, there is little doubt that the values are in defect for low 
divisions and in excess for high divisions. The question therefore is, are these 
defects and excesses such as to invalidate the use of the method of tetrachoric 
functions applied even to variates as skew as the barometer data? The only method 
of answering this question appears to us to consider the relation of the values found 
to their probable errors, and again the amount of stress which is likely in practice to 
be laid on the result deduced from a single fourfold table. We believe that practi- 
cally it is not necessary for the type of reasoning based on a fourfold table that the 
correlation found should be nearer than +°05. In the present case three coefficients 
exceed these limits, but if we proceed solely to two figures the first and last only 
lie outside these limits. Diagram XVI shows this result by the lines AA’ and 
BB’. The hatched part of the diagram corresponds to the region on either side 
the product-moment value of 2°5 times the probable error. Actually some 10°/, 
of the observations should lie outside these limits, i.e. 14 observations instead 
of 3. But even these three have almost contact with the hatched area. It seems 
to us that in this very case of barometer data, which Mr Yule has chosen for 
its marked skewness to discredit tetrachoric r,, the coefficient defeats Mr Yule 
completely ! 


Now let us compare the results with Mr Yule’s own coefficient of association. 
The difficulty in the comparison lies with the standard value of Q against which 
the other Q’s are to be compared. Had Mr Yule studied the surface of constant Q 
—what we may term the association-surface—(see p. 184 above), then the Q corre- 
sponding to the best fitting association surface would have formed a standard Q. 
But we have at present no means of finding this standard Q, and Mr Yule tells us 
that he himself lays no stress on the diversity of values obtained for Q with different 
divisions. However, Mr Yule bas himself taken the mean Q' as a standard when 
he comes in a special case to deal with the relative stability of tetrachoric 7, and 
association Q. Accordingly we follow him in taking a mean Q. But he has gone 
astray in simply taking the arithmetical average of his Q’s or 7,’s without regard 
to their probable errors. The proper means to take are weighted means, and the 
proper standard deviations are weighted standard deviations. Each value must 
be weighted with the inverse square of its probable error as the measure of its 
grade of accuracy. With this weighting we find: 


Barometric Heights. 


Voto Weinhicd | 
Weighted Mean Standard Deviation oan 
| 
| 


Tetrachorie 7; ae “7886 0243 
Association @ Po “9194 ‘0346 
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It will be seen that the mean weighted tetrachoric 7, differs only by ‘0086 from 
the true product-moment value (‘7800). Further the variability of the association 
coefficient marks an increase of 42°4°/, on the variability of the tetrachoric 7;* ; 
there can be no doubt that in this first of Mr Yule’s selected markedly skew cases 
the tetrachoric coefficient is far more stable than the association coefficient. 
Another method of approaching the problem will also illustrate this point. Let 
us express the deviations from the means in the case of both coefficients in terms 


Diacram XVIII. Comparison of r, and Q for Correlation of Barometric Heights. @ has 
nine cases, 7, only one case outside 2°5 times the probable error. 


Barometer Heights. Southampton and Laudale. 
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Percentage in smallest quadrant. 


of their probable errors, and then average these deviations: the mean value of 
(Q -— mean Q)/(probable error of Q)=3'35, the mean value of (7— mean 71;)/ 
(probable error of 7,)=1°63. Clearly Q is 100°/, worse than 7. This is indicated 
on the accompanying Diagram XVIII, where the values of @ and 7; are plotted to 
the percentage frequency of the smallest quadrant of the table, a matter which 


* As both @ and r, have the same possible range of variation -1to +1, in such a case the standard- 
deviation and not the coefficient of variation properly measures the variability. 
Biometrika 1x 34 
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we have found a safe guide to determine how far we introduce risk in using 
tetrachoric 7; It will be seen that the unsatisfactory values of 7, occur when this 
smallest quadrantal frequency is less than 2°5°/,. On the side of Q or 7, towards 
their mean, a line is drawn equal to 2°5 times the probable error; in all but one 
case of 7, this reaches the mean; in nine cases of Q it does not, and the deviations 
in six of these cases are excessive. There can, we think, be no doubt that tetra- 
choric 7, is in this first of Mr Yule’s selected cases far more stable than his 
coefficient of association. 


(B) Skew Table of Number of Mendelian Couplets in Father and Son 
(see Table XVI on our p. 222). 
The following table gives tetrachoric r,; and Q with their probable errors for 
a second series selected by Mr Yule. 


Vertical division taken between 


= = 2-8 34 
esi 7="394:017 r= 874017 | 7,=-35+-030 | 7,=-33+-086 
Q='49#-018 Q="51 +023 = "63 + 048 Q="15 $153 
1—2 | »='374°017 r= 404-017 | 74='39-4025 7 ='B7 +066 
Q= 514-023 | Y=52+-019 (= "61 +029 (="72 £087 
Q—-3 = "354-030 = "394-025 | y= -41 +033 | y= 404073 
Q= "63 + 048 Q=614029 | Y=69+4-034 () = "79 £068 
| 
. | | 
3—} m= "3834086 | r='374:066 | 7,='40+-073 m= 43 +7122 
Q= 754153 | Y= 724-087 Q='79 $068 =90 + 072 


It is interesting to note the high values reached by the probable errors of 
tetrachoric 7 for the small quadrant frequency divisions. Both systems of 
probable* errors are based on the assumption that the total frequency is 4096. 


The following results were obtained : 


Weighted 


Weighted Mean Standard Deviation 


Tetrachoric 7, ie 3809 ‘0176 
Association Y ete 5627 0840 
| = : | 
Of course this table of discrete couplets is not one to which we should think 
of applying tetrachoric 7;,, but Mr Yule has applied it; however, the weighted 


* In using probable errors at all here for a basis of comparison, we assume that the table may be 
considered to have arisen from actual observations. 
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mean of such method when applied is only ‘048 in error, the contingency method 
giving °3288 and the true value being 3333. We see that, judged by weighted 
standard deviations, the relative variabilities are as 176 to 840, or the stability of 
the tetrachoric 1, is 48 times as great as that of Mr Yule’s coefficient. 


Diacrams XIX anp XX. No Q differs by less than its probable error from the weighted mean, 
only one 7; differs by more than its probable error. 
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We can look at this from the standpoint of the diagrammatic representation 
of the probable error (see Diagrams XIX and XX). In the lower figure we have 
the 16 points given by tetrachoric 7,; only a single one (shown by the individual 
linked with the non-black circle) is at a distance more than once the probable 
error from the weighted mean. In the upper figure not a single case occurs in 
which Q differs by so little as its probable error from the weighted mean value, 

34—2 
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There is indeed no comparison between the stability of tetrachoric 7, and Q in 
an illustration which Mr Yule himself has selected to indicate the badness of 
tetrachoric 7! Mr Yule’s coefficient of association, we are told by the statisticians 
of the Royal Statistical Society, will be used for Mendelian problems as soon as 
the Mendelians know a little algebra, but here in the very case of Mendelian 
data we find, as a mere coefficient of association, the 7, of the despised Gaussian 
theory is immensely superior to the Yulean coefficient*. Judged algebraically : 

Average value of (@ — mean Q)/(probable error of Q) = 2°34, 

Average value of (7,— mean 7,)/(probable error of 7,) = ‘57, 


or Q is four times as unstable as tetrachorie 7,. 


(C) Severity of Small-Pox and Strength of Vaccination Immunity. 


Sixteen cases can be worked out for tetrachoric 7; and Q for this material, and 
there is a great variation in both 7 and Qf. 


Haemorrhagic Confluent Abundant - Sparse and 

and Confluent and Abundant and Sparse Very Sparse 
At 10 r=tt? 7,=°3602+4 ‘0911 | ry-="2857+ °0614 | 7,=°2226 + 0626 
@=1+ ‘0000 =°7892+°1296 | Y=°5415+°1036 = ‘4319 + '1036 
At 25 = °2905 + 0598 | 7,= "3694+ °0281 | 7,=°3484+ 0249 | 7,='2500 + '0293 
Y= "571141079 | Y='5411+4'0413 | Y= 4469+ 0303 | Y='3444+ -0369 
At 45 r= °2187+ 0556 | 7,=°4169 + 0309 | 7,=°3961 +4 °0275 | 7,='2578 + 0332 
=°'4111+4°0890 | G='5714+ 0326 | @='5474+4°0351 | Y="4143 + 0569 
Between 45 and | 7,= °0426+4°0770 | 7,=°5121+4°0863 | r,=°6022+°0287 | 7,=°5381 + 0349 
Unvaccinated == OOO iia = *7220 + -0322 = 8599 + ‘0308 en "8862 + -0524 

| 


In the form in which Table XXII is given on our p. 254, all values of Q and 7 
are negative. 


This table gives us the following results : 


| Weighted | 


Weighted Mean Standard Deviation 


| — ~~ —— 


Tetrachoric 7, ss 3827 1211 
Association Q@ ... “5902 Bi | 


* Tt is needless to say we should never have thought of applying either of these coefficients to 
theoretical Mendelian data; we hold that the correct method was the method applied to this very 
case by one of us ab initio, namely that of product-moment r. 

+ There can be little doubt that the extreme variations are due to 9 out of the 16 divisions giving 
a quadrant of minimum frequency with less than 1°/) of frequency in it. 
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In forming these weighted means we have left out the value of @=1 + ‘0000 
and of =?+?*. In the first place had we included this value of Q, its weight 
would have been infinite, and whatever the value of 7; and its probable error may 
be, it follows that Q must be more unstable than 7. We have got rid of this 
anomalous Q, although Mr Yule’s theory gives it infinite weight, and dealt only 
with the remaining 15 cases; but to do this is like considering the assets of a 
bankrupt, after we have disregarded the claims of his principal creditor. The 
above table, however, shows that @ is, notwithstanding this disregard, much worse 
than 7;; in fact 43°4°/. must be added to the variability of 7; to reach that of Q. 


If we measure the deviations from the weighted means in terms of the probable 
errors as before we find for tetrachoric 7: the mean = 2°71 and for association 
Q: =331. Thus again, although since Q=1 is excluded, we find to a lesser 
extent tetrachoric 7, more stable than Q. 


(D) Lengths of Ivy Leaves. 


Mr Yule directly selected the lengths of growing ivy leaves as an especially 
skew distribution upon which to test tetrachoric 7;, and this in a case where the 
table had been deduced for homotyposis, i.e. with all the local lumpiness which 
arises from that method of treatment. If we take the 4 x 4-fold of our Table XVII 
on p. 222, nine values of @ and tetrachoric 7; are available. They are: 


=) Q—38 oy 
1—2 | ™4=6998+:0046 | 7,='6406+°0039 | 7,=°5572+ -0058 
)= 8531+ °0027 | GY=-8655+:0040 | Q==-9167 + 0085 


2—3 7, = 6406 + :0039 7,= 5731 + (0033 r,= 5218 + ‘0046 
= °8655 + 0040 (= 6768 + 0033 (= °7570 + (0048 
3-4 | m='5572+-0058 | 7,=°5218+-0046 | 7,= 55484-0052 
= 9167 + 0085 Q@=°7570 + :0048 (= "7508 + *0045 
We have the following results: 
Weighted | 


Weighted Mean Standard Deviation 


Tetrachoric 7; ae “5920 "0570 
Association Q vox “8024 0757 


* On the difficulty, in fact idleness, of attempting to determine 7, from tables with zero frequency in 
one quadrant: see Appendix I. That quadrant as we there show might have °5 in it, or, indeed, 
the material from which it is drawn unity. In the former case 20 terms in the 7,-equation will not 
suffice to determine the value of r;. In the latter case 7, might swing over from —1 to a small positive 
quantity. In fact the process of finding tetrachoric r, in such cases warns its user that it is indeter- 
minable. Users of Q will assert that the relationship is perfect with a zero probable error ! 
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Therefore Q exhibits an increase of about 33°/,'on the variability of tetrachoric 7;. 
If we take the probable error test 


— weighted mean 7; 


Average value of probable error 


() — weighted mean Q 
f § 


probable error = 108 


Average value o 


Thus @ represents an increase of 40°/, on the variability of 7. 


(EK) Correlation of Hair and Eye Colours. Livi’s Datat. 


The table is: 
Eye-Colour. 


: Blue | Grey | Brown Black Totals 
: | 
S| Blond... 9083 - | 816% je.) 70st. 1a. aay 24518 
S| Red ae 343... S518) i 819 37 1717 
2 | Brown... 17829 | 39467 | 117522 4945 179763 
A) Black 1, 3627 13433 | 54883 20919 92862 
an _———— 

Totals ... | 30882 | 61605 ya | 26118 298860 


This table gives the following nine comparative results: 


| Blue-Grey_ . Grey-Brown Brown-Black 
Blond-Red sits 74,= °53807 + °0026 74= 5074+ °0023* | 7,=°4392 + 0054* 
| = 7442 + -0023 QY='7129 + 0023 Q= 8422 + 0067 


Red-Brown .... | 7=°52394-0080 | 7,=°5255+-0023* | 7,= +4332 + -0048* 
Q="7356+ 0023 | W='7263+-0023 | @=:8294+ -0067 


Brown-Black ... 7,= °3601 + 0025 7, = 3240 + 0020 74= 6449 + 0020 
Q='5791 +0041 = 4393 + ‘0026 (= '8365 + ‘0016 


Now it is of interest to compare the values of 7, found by fourfold tables with 
those obtained by other methods. Mr Yule has nothing but the method of 
pseudo-ranks to apply to the table in its detailed form. This gives a Yulean 
3680; the value found by corrected mean square contingency is ‘5189; the 
weighted mean of the nine tetrachoric 7 values is ‘4842, much nearer to the 
contingency value. Indeed the Yulean, if corrected for ranks becomes °3953, and if 
corrected for class-index correlations becomes ‘5051, i.e. differs quite insignificantly 


| Antropometria militare, Part I. p. 62. 
* Mr Yule’s values do not agree to the second decimal place with ours in these cases. 
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from the contingency ‘5189*. It is thus easy to see why Mr Yule gets such 
an absurdly low value. If it be said that two out of the nine values of tetra- 
choric 7, are lower than the Yulean, the reply is a simple one: They are not 
divisions which would be made with our present knowledge of hair and eye 
pigmentation. There is no such thing as a “black” eye, at best it is only a dark 
shade of brown, and the division between brown and black is largely a matter of 
personal equation. Again the relative amount of pigmentation in a true grey eye 
may be as small as in many so-called blue eyes. The natural physiological division 
is between the blue-grey and the brown-black groups, although, if Livi put his 
hazels and greys with some pigment partly into blues and partly into browns, a 
division between blue and grey might give as good results. Passing now to the 
hair-colour, we believe the distinction between brown and black to be again a 
matter of personal equation, the shades of brown range up to black. Again red 
is a tint which may contain less pigment granules than the blond}, but in some 
varieties it has more granules than many browns. Hence the difficulty of such 
a division as that betwixt brown and red. ‘The real division would come between 
blond and brown with reds omitted or better still microscopically tested and 
placed in their appropriate division. Anyhow the divisions corresponding to the 
four left-hand top quadrants of the last table seem to us the most satisfactory, and 
they all agree in giving the correlation of hair and eye colours = circa *52, a result 
in excellent accord with the mean square contingency. 


Pearson’s data for British school children give by corrected contingency *52, 
and the following table shows that the values for all nations must be in excess of 
that determined by Mr Yule’s method : 


Correlation of Hair and Eye Colours by corrected 


Contingency. 
Italian Recruits ive oe Res ae ae 519 
British School Children ae is ie ae 524 
Baden Recruits as me ae ee Be "484 
German Jewish Children _ nae se Bee “444 
Prussian School Children... bes 2h ee “401 
Swedish Recruits oat a hs ae ae 414 
Italians by Yule’s Method of Pseudo-Ranks bes 368 


* Assuming normal correlation, the correction for ranks is r=2 sin (30° x °3680)=°3953. For hair 
groups the class-index correlation is ‘8686 and for eyes °9010, and variate correlation 


= °3953/(:8686 x ‘9010) =-5051. 


Of course these are only rough approximations, for Mr Yule’s theory of unit subranges precludes our 
using the corrections actually given. But any corrections will be roughly of this order. 

+ Some reds lack all granular pigmentation; there are blonds which also do this, but they are very 
few in number. Both reds and blonds without melanin pigment granules are truly albinotic hairs, 
whether accompanied or not by albinotic eyes. 
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Returning to the relative values of tetrachoric 7; and association Q we find: 


| a ee 
- ghted 
| Weighted Mean | Standard Deviation 


Tetrachoric 7; aoe hel “4894 “1102 
Association @ ... "7258 | ‘1161 


The variation is high for both coefficients, but even here tetrachoric 7, is better 
than @, you must raise the variability of r, 5°4°/, to reach the variability of Q. 
But the worth of the two coefficients is wholly unequal. If we are told the 
correlation of hair and eye colours is about '50, a whole series of ideas is associated 
with this number; a very little additional information gives the relative standard 
deviations of the two variates—accurately enough for practice—and we picture to 
ourselves the regression lines and the associated changes in pigmentation of hair 
and eye classes. But what does an association coefficient of ‘73 for special dicho- 
tomies tell us? We venture to assert that it conveys no information whatever 
to the investigator’s mind, and is absolutely incomparable with other association 
coefficients of the same table, because each for the same system depends on the 
values of the variates at which the divisions are made, and because it has not the 
least relation with any physical properties of the distribution. If Mr Yule replies 
to these criticisms, that tetrachoric 7, is also unstable, if not to the same extent, 
and that a function of Q does provide (if the table be doctored) a certain difference 
of percentages, we answer that tetrachoric 7, 1s far from so unstable for the distri- 
butions of ordinary practice as he has endeavoured to make it out by selecting: 
(1) pigmentation data, which as long ago as 1901 were recognised as irregular, and 
(ii) markedly skew frequencies*. The instability is many times compensated by 
the definite physical significance of the coefficient. 


Further the percentages which Mr Yule’s deduced coefficient of colligation 
represents are wholly artificial and incapable of any rational interpretation. If we 
were to equalise the number of vaccinated and unvaccinated in any locality, how 
could we equalise the number of deaths and the number of recoveries, and what 
intelligible meaning can be given to the percentages when it has been done ? 
How can we possibly give any interpretation to the result reached by a disciple 
of Mr Yule that the “index de corrélation’”—i.e. Mr Yule’s Q—between the 
stature of recruits and rent in the 20 districts of the city of Paris is “perfect” ? 
Average stature of recruits for different districts forms a continuous variate system, 
so does average rent, and the two properly correlated would show the nature of 
the regression line, but this disciple of Mr Yule’s, in order to save a little absolutely 


* Mr Yule (loc. cit. p. 624) speaks of the selection he has made as ‘‘ exhibiting moderately skew 
distributions.” Unless he means that they are not U- or J-shaped curves, we consider this an entire 
misnomer. 
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needful arithmetic, tells us that the index of correlation between stature and rent 
is perfect * ! 
(F) Lye-Colour Data of Pearson. 


We now come to the eye-colour data. These are the only cases we have so 
far worked out in which the variability of Q is at all comparable with that of 
tetrachoric 7;, and the reason is not very far to seek. We have already pointed 
out that the true difficulty in these* eye-colour tables appears to lie in lumps 
of excess frequency in or near the corner cells of the quadrants of less frequency 
and that thus the material is heterogeneous in character. We think it quite 
probable that this is due to the inclusion of really senile fathers on the one hand 
and of infant sons on the other, or of pairs of brothers or sisters one of whom is 
an infant. In this manner eye tints which are originally, or will become, mediocre 
in colour may be classified as very light. The record which Sir Francis Galton 
provided gave no ages; the original data are now in the possession of the Galton 
Laboratory and it is proposed to reconsider both these and the Huxley Lecture 
data for eye-colourt+, paying attention to change of eye-colour with age. This 
investigation will necessarily take a considerable time and it might be wiser 
to await its conclusion before entering further on this topic. But we should no 
doubt be told that we were omitting just those cases that appeared favourable 
to Mr Yule’s association coefficient, and accordingly we have included the eye- 
colour data here. The table on p. 274 gives the Brother-Brother coefficients for 
16 divisions. 

Now this is a symmetrical table and Mr Yule reckons the diagonal coefficients 
once, and the repeated coefficients twice, but we are doubtful of the accuracy of 
this process. Such a symmetrical table leads to exactly the same results from 
symmetrically placed divisions, and it is not clear why double weight should be 
given to such a division as 2—3 and 6—7 because as the table is written out 
it occurs twice. It seems to us that the diagonal values and all on one side of 
them are the only independent coefficients; we have lost the independence of the 
coefficients in the cells symmetrically situated with regard to the diagonal by 
the very process of adding the tables for First Brother in terms of Second Brother 


* A. Niceforo, ‘‘Contribution 4 l’étude des corrélations entre le bien-étre économique et quelques 
faites de la vie démographique,” Journal de la Société de Statistique de Paris, 52 Année (1911), 
pp. 322—341. Professor Niceforo studies ‘‘corrélations” by aid of the coefficient of association, 
which he applies to the following continuous variates: stature, rent, probable income, numbers of 
illiterates, of insanitary dwellings, of paupers, of pauper funerals, of workers, size of families, numbers 
of cubic feet of air space, of inhabitants to the acre, special death-rates from all sorts of diseases, 
general death-rates and birth-rates etc., ete. We believe that the whole of this work must be redone. 
Even if a rough estimate had been required, Mr Yule’s coefficient should not have been used, but the 
division made at the median values and Sheppard’s formula for tetrachoric 7, adopted. There is not 
even the excuse of apparent discreteness in any of Professor Niceforo’s attributes. 

+ Pairs of siblings in the same school are much more nearly of an age than any pair between the 
ages of 5 and 15 taken from the population of school children. While the head-measurements were 
corrected for age, the hair and eye colours were not, and probably the changes are more important than 
we believed at that date. 


Biometrika 1x 
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First Brother. 


Second Brother. 


2-3 =) 45 67 
2-3 | m='5058+°0172 | ™='3761+°0184 | 7r,=2428+-0212 | 7,=-2373+4-0240 
(= "6214+ :0176 =:4920+°0230 | Q=-3434+-0304 | @=-3653+ -0383 
3—4 | 4='37614-0184 | 74=°5185+°0160 | 7,=°3524+-0196 | 7,=-2777-+-0230 
= -4920 + 0230 = 6229 + 0165 = 4603+ 0232 | @='3963+-0301 
4—5 | m='24284-0212 | 7,='35244-0196 | 7,=°36774°0209 | 7,=-2978+ -0246 
| ) = "3434 + 0304 = 4603 + 0232 = "5089+ +0231 | @=-4306+ -0301 
6—Y | 4=:23734-0240 | 7,=:27774°0230 | 7,='29784-0246 | 7,='3128+-0276 
(= 3653 + 0383 ) = 3963 + 0301 Q=-43064°0301 | @=-4705 + -0326 


and Second Brother in terms of First, and the repetition of the same numbers in 
a second cell does not give those numbers double weight. Had we worked our 
tables for Elder Brother and Younger Brother, each cell would have had 
independent weight, but adding them reversed we have lost one-half of the 
non-diagonal independent frequencies, and we must not still retain the same 
number of independent weights. Relative to Mr Yule’s method of procedure, this, 
in our opinion, true method of weighting emphasises in symmetrical tables the 
diagonal columns and would correspondingly better tetrachoric 7, as against Q. 
But we have not used what we consider the true weighting, because it might be 
said that it had been adopted with a view to bettering our position. Arranging as 
before we have the following results for the 16 coefficients : 


Weighted 


| | Weighted Mean  Seandarae Deviation 


Tetrachoric 7; ree | 3474 0931 
Association @ aa | 5083 0952 


Thus the positions of r, and Q are just reversed by proper weighting*. The 
coefficient of variation has no meaning, we hold, in the case of mere numerics like 
r, and Q, both of which may range from —1 to +1 in the general case. Indeed 
the case is more complex than can be accurately determined even by weighted 
standard deviations. For Q, although nominally ranging between +1, is for any 
given case numerically greater than the corresponding tetrachoric 7,, and thus 
their variabilities if not functionally related are related by limitations. For 
reasons already given we see no advantage in considering the probable error of 

* Mr Yule (loc, cit. p. 634) gives them as ‘084 and ‘081. If we take into account all the reasonably 
possible divisions 36 in number, i.e. 1-2, 2-3, 3-4, 4-5, 6-7, 7-8, the standard deviations are respectively 
o,='121 and TQ= ‘130, while the ranges, on which Mr Yule appears to lay stress, take for 7, the value: 
‘59 and for Q@: the value -92! 
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colligation (it is quite easy to deduce another “coefficient of association” from 
Q, which will have a still less probable error than , or one from tetrachoric 7; 
having a less value than both: see p. 262 ftn.). Further we do not consider that the 
product-moment method (i.e. ¢ which Mr Yule erroneously terms the correlation 
coefficient) has any application to these eye-colour data; it is purely idle to deal 
with the difference between a ‘light brown’ and ‘dark brown’ eye as a discrete 
* unit.’ 

We next turn to the Father and Son Eye-Colour data. The table for 
16 divisions is: 


For Father. 
| gaee | 8—4 4—5 | 6—7 
| . 
2-8 r= 504 + 029 r= '405 +°031 m= '385+°0385 | 7,='316-+°041 
= 616 + °030 = 528 + 038 = '549 + 050 = "487 + 064 
S| 8-4 r= 391 +°031 7, = "550 + 027 = 493 + 032 r= 421 + °038 
RN = '500 + 038 = 658 + 027 )= 632 + 034 = "579+ ‘045 
5 
ics 
5 74= 276 + 035 7,= ‘466 + 031 74='575 + 032 7,=°517 + 037 
= "381+ 049 ='590 + ‘034 ='716 + ‘028 Q= "684 + 035 
GH 7,= ‘266 + °040 7,= ‘374+ 038 r= °457 + 040 r= "512 + 039 
= "402 + 062 = "519+ 047 = 622 + 040 @= 695 + 037 


In this case we find: 


Weighted 
Standard Deviation 


| Weighted Mean 


Tetrachoric 7; ae "443 086 
Association Q 605 086 


If we thought range was a measure of variation we should have: 
Range of tetrachoric 7; = ‘31. 
Range of association Q = ‘34. 


It is clear that for these eye-colour tables Q is almost as stable as tetra- 
choric 7,. Indeed judged by the probable error test Q is slightly better than 
tetrachoric 7;, for we have 


«- 
Mean value of Brothers and Brothers | Fathers and Sons | 
(74-7)/(p.e. of ey) eee 3°5 2°2 | 
(Q- Q)/(p.e. of Q) ... 3°2 1:9 


35—2 
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The weighted standard deviation test, which is probably a better one, shows 
that 7, is superior to Q for the Brothers table and equal to it for the Father and 
Son table; both are terribly bad and a slight change in the nature of the test 
may make one or other apparently slightly superior. In this eye-colour material 
in its present state we are quite clear that the tetrachoric method applied 
to extremely skew divisions will not give consistent results. But this had been 
stated years ago, in passages which Mr Yule refrains from quoting. We are 
equally clear that Q gives no better results, while for the bulk of tables of 
statistical practice, it certainly gives less stable results than 7. 


(G) Age of Husband and Wife Data. 


This is a case of extreme skewness* selected by Mr Yule to show how tetra- 
choric 7, varies. It is also a case of heterogeneity, for second marriages are mixed 
up with first marriages. With his usual ingenuity Mr Yule has exhibited a curve 
and a table, in which he has increased the number of the terminal values, where 
tetrachoric 7; gives too low values and has little weight, and shown few of the 
central values where 7, is relatively steady and has great weight. Further, he 
tells us that: 


“From the standpoint of the calculator, however, the table presents the 
disadvantage that the correlation is high, viz. ‘91, and the approximation to the 
value of the normal coefficient correspondingly slow, eight to ten or twelve terms 
of the equation being necessary to give a value fairly trustworthy in the second 
place of decimals” (loc. cit. p. 624). Mr Yule may have got results “ fairly trust- 
worthy in the second place of decimals,” but as either the odd or even series 
of powers of 7, in the equation rarely becomes convergent till much beyond the 
twelfth term—we have had in some cases to go to 18 or 20 terms—the confidence 
Mr Yule put in his values was quite unjustified. The case is an interesting one 
because the true correlation is very high, i.e. ‘9253 + 00004}. Accordingly, 
Mr Yule’s association coefficient whatever division is taken is constrained to lie 
between something like ‘952 and 1. It is therefore difficult to compare the range 
of instability of @ with that of tetrachoric 7, We are bound to consider both © 
in relation to their possible ranges. 


That the curve given for tetrachoric 7, indicates no defect in 7; relatively 
to other coefficients, will be at once appreciated by comparing it with the curve 
for Mr Yule’s coefficient of colligation w! That coefficient varies just as much, only 
it is the other way round: see Diagram XXI, p. 278. As for the Boas-Yulean 
coefficient @, we could only assume from it, that there was, when the dichotomies 
were at young or old ages, a low relation between the ages of Husband and Wife. 

* The actual skewnesses are: for Husbands Sk.='71 and for Wives Sk.=:76. These are among 
the highest values on record for skewness, and the surface is not ‘‘ moderately skew”? as Mr Yule 
without publishing any numbers (loc. cit. p. 624) states it to be. 

+ The regression line is sensibly curved at the terminals, but this does not markedly influence 7 


which uncorrected is ‘9142, as against r=‘9136 without Sheppard. With Sheppard 7 rises to °9253. 
Mr Yule gives 91 for this correlation, which appears to us an uncorrected value. 
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Yet according to Mr Yule @ is “applicable in its entirety to the 2 x 2-fold 
table.” Of course we may “apply” any method to any problem, but whether 
we shall obtain anything of value from the application is another question, 
and Pearson’s original statement about Boas’ coefficient that “it differs in the 
simplest cases from the true coefficient of correlation, and often differs con- 
siderably...and its use is liable to be misleading, especially if compared with 
values of the true coefficient found by other processes*” was amply justified and is 
well illustrated by this case. The tetrachoric 7, values cluster round the true 
value, the Boas-Yulean never reaches it. In the following table the values of 
the tetrachoric r,, of the coefficient of association Q, of the coefficient of colligation 
w, and of the Boas-Yulean, Pearson’s ¢, are recorded with their probable errors for 
all divisions at like ages, since these are the divisions selected by Mr Yule. 


Ages of Husband and Wife. Values of the various Coefficients 
proposed to measure Association. 


a sae "t Q id | we 
18 2 ‘9991 + ‘00018 "9587 + ‘0004 0711 + :0097 
19 2 “9965 + 00020 ‘9197 + 0022 “1240 + 0042 
20 *7755 + 0025 “9932 + :00016 *8895 + 0012 *2128 + 0024 
25 *8813 + :00380 ‘9734 + ‘00010 °7919 + :0003 5644 + -0005 
30 9302+ °0001 | °9745+ 00006 *7958 + ‘0002 °7137 + 0002 
35 ‘9522 + ‘0001 *9795 + ‘00005 *8151 + 0002 °7735 + 0002 
40 *9535 + ‘0001 *9821 + ‘00004 *8265 + ‘0002 *7948 + ‘OU02 
45 *9630 + ‘0001 “9846 + ‘00004 “8381 + 0002 *8003 + ‘0002 
50 ‘9601+ -°0001 | +9850+ -00004 *8401 + ‘0002 *7867 + 0002 
55 *9570 + ‘0001 "9863 + ‘00004 *8468 + ‘0002 *7658 + °0002 
60 °9471 + ‘0001 "9864 + ‘00005 *8471 + ‘0002 *7260 + °0003 
65 *9350 + ‘0002 *9881 + °00005 *8565 + °00038 °6733 + 0005 
70 *9159 + ‘0003 *9897 + ‘00006 *8656 + °00038 *5947 + 0007 
75 °8917 + 0006 *9922 + ‘00006 *8823 + ‘0005 *4932 + ‘0012 
80 *8450 + :0020 *9946 + :00008 *9013 + ‘0007 "3512 + ‘0024 
85 *8081 + ‘0024 *9975 + 00010 9317 + -0013 2046 + ‘0046 


The first two divisions were not used because the work of calculating tetra- 
choric tables to the large number of terms in 7; required for getting results even 
approximately correct is excessive when 1, is large and the dichotomies extreme. 
It will be seen that we very frequently differ by a unit in the second figure of 
r, from Mr Yule’s results and that our diagram (Diagram XX1) differs from his 
for r, by being sensibly nearer to the true correlation. He has stopped his series 
before it began to converge properly. 


The following gives the percentage in the least quadrant at each age division : 


0003 |°005) 031 | -962 | 1°92 | 2°13 | 2°12 | 1°88 | 1°69 | 1°35 | 1°10 | °734 | 444 | -216 eat 


* Science, Vol. xxx. p. 24. 
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Diagram showing variations of various coefficients for fourfold tables with diagonal 
dichotomies in the case of Ages at Marriage of Husband and Wife. 


Diacram XXI, 
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The reader may ask what are our actual grounds for neglecting the results 
obtained from extreme divisions in the case of tetrachoric r,? The answer is that 
it is not only experience of how slight skewness at extreme divisions produces large 
changes in the value of 7,, but our knowledge that the weights of these outlying 
divisions are in the case of 7, (to a lesser extent in the case of $) insignificant 
as compared with the weights of the central divisions. No such grave differences 
exist in the case of Q or (to a slightly lesser extent) in the case of w. The 
following table gives the weights, treating the weights of the division at 20 
as unity. 


Division r | Q 5 OB | 
20 1 1 1 1 | 
25 69 2°7 12°5 24°5 | 
30 370 61 Q7°4 97°9 | 
35 772 10°9 36°2 171°9 | 
40 977 14:3 40'1 187°7 
45 1276 16:7 44:7 153°1 
50 977 15:8 40°1 145°6 
55 625 15:0 31°4 97°1 
60 319 11°9 25:2 554. 
65 142 | 10:0 18°5 22°6 
"On. | 54 | 7-7 12°5 10°6 
5 | 16 5:9 6:9 3°8 
80 15 3°7 3-0 11 
85 | 11 | 2°5 0:9 0:3 


It will be clear: first that it is idle to measure the variations of these quantities 
from anything but a weighted mean, and secondly that no one would after seeing 
such weights dream of determining tetrachoric r, from extreme divisions. If we 
omit the first two and the last two values of 7,, noting their sight weight, then 
the remaining values in no case differ by ‘04 from the true correlation and the 
mean divergence is 015 only. Mr Yule’s Q must be steadier here than 7,, because 
its range is limited by the nature of the case to about ‘05, while there is no limit 
to the range of the latter. 


(H) Association in a typical table of Ordinary Statistical Practice. 


Thus far we have dealt with the relative stability of the coefficient of 
association and tetrachoric 7, on material especially selected by Mr Yule to 
exhibit the variable character of tetrachoric 7,. But in concluding this branch 
of our discussion we should like to exhibit the relationship of the two coefficients 
for such surfaces as occur in ordinary statistical practice. For this purpose we 
will make no selection ourselves, but illustrate the matter on the correlation table 
chosen by Mr Yule himself in his first memoir on Association*. We take the 
table as it is given without any knowledge of its degree of approach to the 
Gaussian or of the material with which it deals. Its coefficient of correlation by 
the product-moment method = ‘677. Now we commenced by leaving out all the 


* See Phil. Trans. Vol. 194 A, p. 277. 
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border cases of the table which give Q=+1 with infinite weight. Also the 
cases adjacent to these where Q, depending upon very small frequency in one 
quadrant, has a very high value, a low probable error, and accordingly a very 
great weight. With the exception of this we took every alternate case in every 
row and so obtained 41 coefficients; the corresponding tetrachoric r,s were like- 
wise calculated. The results are exhibited in the accompanying table, the 
notation being that adopted by Mr Yule to mark the divisions. 


There is practical certainty that any extension of the boundaries of this system 
of divisions could only better the position of 7, relative to Y. We obtained the 
following results : 


Weighted Mean Q= 834. Weighted Mean 1, = ‘672. 
Weighted s.p. of Q = 0613. Weighted s.D. of 7, = 0478. 


We must therefore increase the variability of tetrachoric 7, by 28 °/, to reach 
the variability of Q. This shows the real degree of difference in the stabilities 
of r, and Q for the correlation-tables of ordinary practice. We note also that the 
weighted mean of 7, differs by only ‘005 from the true product-moment value, 
‘677, of the correlation. 


Now let us approach the matter from the standpoint of probable error. We 
have : 

Mean (r, — ,)/probable error = 1:06. 
Mean (Q — Q)/probable error = 1°72. 

In the case of 7,—#, we have 21 below their probable error in value and 20 
above it, just what there should be. Only 5 exceed twice the probable error and 
there should be 4. In other words the distribution of 7, in terms of its probable 
error might well have arisen from random sampling of Gaussian material. Now 
turn to Y: in terms of the probable error Q shows an increase of upwards of 
62 °/, on the value for 7,. There are only 15 values of Q—Q below this probable 
error, 26 in excess of it. There are 15 values instead of 4 in excess of twice 
the probable error. There are five values in excess of three times their probable 
error, compared with only two occurring in the case of 7,. It is, we think, obvious 
that the variations in the case of @ are far greater than those due to random 
sampling*. Our Diagrams XXII and XXIII (pp. 282-3) indicate two points. 
In the upper parts of these diagrams we have plotted 7, and @ to the percentage of 
frequency in the quadrant of least frequency. We see at once that if we avoid 
quadrantal frequencies under 1 “/, the value of tetrachoric 1, is for practical purposes 
equal to the true product-moment 7. The reader will recognise the far greater 
scatter of Q. In the lower figures we have plotted 7, and Q with relation to their 
probable errors; the full dot denotes the observation, the open dot the end of a line 


* The reader must remember that there is no reason to assert that the errors must be of the order of 
random sampling; there is only one table, not many random samplings from a much larger mass, and 
we take different divisions. But we assert that if the errors be of that order, then the method is as good 
as the data warrant our using. 
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Values of Q. 
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equal to twice the probable error; where this open dot is seen, there twice the 
probable error fails to reach the true value of 7, or the mean value of Q as the 
case may be. The failures of Q are seen to be three times as many as those of 7,. 


Diagram XXII, Values of association @ for a typical table of ordinary statistical practice. 


on enlarged horizontal scale, 
with twice their probable errors. 
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We may conclude justifiably that even asa coefficient of association 7, is much 
more stable than @ for the usual type of distribution ; and we have already seen 
that this is so even for many marked cases of skewness. It is always an approxi- 
mation and often a very good approximation to the true correlation. On the 
other hand Q conveys no idea to the mind at all, except in as far as it is an 
approximation, usually very bad, to the same correlation. 


Calculated values of 7, (fourfold tables). 
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(14) General Protest against the Use of Mr Yule’s Coefficient of Association. 


It may be said that a vigorous protest against Mr Yule’s coefficients 1s 
unnecessary. We believe on the contrary that, if not made now and made 
strongly, there will be a great set-back to both modern statistical theory and 


Dracram XXIII. Values of r, for a typical table of ordinary statistical practice. 
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modern statistical practice. The publication of Mr Yule’s text-book has resusci- 
tated the use of his coefficient of association; it is now being used in all sorts of 
quarters for all sorts of unsuitable data. The publication of Mr Yule’s recent 
paper on Association will also lead to the use of his method of pseudo-ranks. The 
36—2 


errors). 
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coefticients of association and colligation are in our opinion wholly fallacious, they 
represent no true properties of the actual distributions, and they have no adequate 
physical interpretation. The coefficient obtained by the method of pseudo-ranks 
is equally fallacious, unless the variables proceed by and have been grouped by 
discrete units. But both Mr Yule’s methods are so easy of application, that those 
who will not devote the small amount of time and energy requisite to the dis- 
cussion of data by more adequate processes at once adopt them without further 
consideration. Thus Mr Yule’s coefficient of association is passing into French 
statistical literature as [indice de corrélation, a term originally introduced by 
Galton for the coefficient of correlation and now transferred to the different and we 
hold fallacious Yulean measure of association. Professor Niceforo writes *: “ Mais 
pour nos études sur la corrélation entre les différents phénoménes économiques, 
démographiques et autres, dans les quartiers et les arrondissements des grandes 
villes, ot nos trouvions en présence de séries formées par un nombre plutét re- 
streint d’éléments (80, 25, 20) nous avons préféré nous servir de la méthode Yule, 
plus rapide et donnant, quoique moins précise que la méthode précédente, des 
résultats tres satisfaisants.” The preceding method is the method of the product- 
moment which Professor Niceforo discards for Mr Yule’s association coefficient, 
using it solely—and apparently with Mr Yule’s approval (see loc. cit. p. 324)— 
for absolutely continuous variates, where the coefficient of correlation could be at 
once found and a graph easily drawn of the regression line. Professor Niceforo 
speaks everywhere in his paper of the correlation being this or that, and entitles 
his paper “Contribution a l'étude des corrélations entre le bien-étre économique 
et quelques faits de la vie démographique.” What his or Mr Yule’s test of 
“résultats tres satisfaisants” may be we do not know, but we consider that the 
whole of Professor Niceforo’s work will have to be repeated before anything 
can be learnt from his data. 


We have worked out two illustrative cases from Professor Niceforo’s material 
to indicate what we consider the extreme danger of Mr Yule’s methods. In the 
first case, that of the average stature of conscripts and the average rent in the 20 
arrondissements of Paris, the answer Professor Niceforo gives is Q=1+°0. The 
correlation we are told is perfect. In the second case, that of the correlation 
between the mortality and natality of the same arrondissements, we are told that 
the “corrélation...est tres forte.’ Professor Niceforo gives it as: 


Indice de corrélation R = 0:977 + :032,450 (sic !). 


We had failed to give any real interpretation to either of these results, and we 
turned to the original data. These we found hard to discover, because Professor 
Niceforo does not refer at each stage to the exact source of his original material. 
Manouvier gives the mean stature of the 20 arrondissements in 1880 and 1881+. 
These appear to be what is given as Statwra media in Professor Niceforo’s book 


* Journal de la Société de Statistique de Paris, 52 Année, pp. 322—341; see p. 324 and elsewhere. 
+ Bulletins de la Société @ Anthropologie de Paris, Année 1888, p. 161. 
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Forza e Ricchezza, Turin, 1906, p. 15. For the rents we have not been able to 
verify Professor Niceforo’s returns, or to discover whether they are for approxi- 
mately the same years as the statures. In the Annuaire Statistique de la Ville de 
Paris, 1901, no similar details as to rent appear to be given, and Professor Niceforo 
gives no reference to the year for his rent data. Taking, however, the data given 
in his book we find: 

Mean Stature = 1645-75. Mean Rent* — Ors: 

Standard Deviation = 6°212. Standard Deviation = 161°43. 

Correlation of Stature and Rent = °7825 + :0591. 
Regression line of Stature S on Rent R: 
S = 1639-4 + 03011 R. 
This is represented on Diagram XXIV and we see at once that we have a quite 

intelligible relation between average rent in the arrondissement and average 
stature of conscripts. The correlation is high, but far from “ perfect,” and is subject 


Diagram XXIV. Association of Stature and Rent in Parisian Arrondissements. 
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to a large probable error, so that the true relation might be easily anything between 
‘66 and ‘90. If, however, we divide at the mean values and add up all the dots as 
; Les 
Professor Niceforo appears to have done, we have the fourfold table: 1 oe 
giving association @=1+°0. What is the value of this, what does it signify ? 
We fail to extract the least idea of the real relationship—as represented by the 
dots on the graph—from such a statement. Verbally it means simply that no 
rent of over 210'8 is associated with a stature under 1645°75, but to assert that 


* We are unable to say in what units or for what periods rent is measured. 
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this signifies that “la corrélation est parfaite” is totally misleading. Professor 
Niceforo speaks of the correlation between stature of conscripts and the rent 
of their arrondissements of origin; he does not say that he is merely dealing with 
the practically unimportant fact that no arrondissement with a mean rent over 
211 has given a mean stature under 1646. But even to state this simple fact 
would be more enlightening than to talk of the index of correlation equalling 
unity. 

The second case we took was that of the mortality and natality of the same 
20 arrondissements. Again we had great difficulty in tracing the original source 
of the information. 

However in the Annuaire Statistique de la Ville de Paris, Année 1904, we find, 
p. 125, the natality per year based upon 1000 women of ages 15—49, and, on 
p. 135, the annual mortality based on 1000 inhabitants for these arrondissements. 
The natality is for the period 1886—1895, the mortality is given for the periods 
1886—1890 and 1891—1895 separately but not combined. As Professor Niceforo 
does not give a reference to the years dealt with, nor the source of this Parisian 
data, we have taken the simple mean of the mortality for the two periods and 
correlated this with the natality for 1886—1895. The constants found are: 

Mean Mortality =21-73: Mean Natality = 79-065. 
Standard Deviation = £9215. Standard Deviation = 23°7298. 
Correlation = r = ‘9168 + ‘0242. 
Regression Line of Natality V on Mortality J: 
N = 44181 M — 16°94, 


Diagram X XV indicates the position of the observations and their relation to 
the regression line. It conveys as adequate a representation of the whole relation- 
ship as it is possible to give on the data. But if we count up on the graph the 
dots in the quadrants obtained by drawing the lines at the means we obtain the 
0/8 
a eal 
the real nature of the correlation is given by such a result ? 


fourfold division leading again to Q=1+°0*. What information as to 


If Professor Niceforo had desired to obtain a rapid approximate value to the 
true correlation in these cases, he should have drawn his divisions at the medians, 
not the means, and used Sheppard’s formula 


Y = COS 1F lie for ea! 
7 a+b bla 


to find tetrachoric r,. In this case his results would have been the tables 
8 | 2 9|1 


318 and T[! 
r, = 81 +05, for rent and stature, 
r,=°95 + 04, for natality and mortality. 


* Professor Niceforo gives ‘977 probably from slightly different data. 
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Both results correspond within the limits of the probable errors with the actual 
correlations "78 and ‘92. These results are far more valid than the association 
results, but of course have not the value of a graph showing the regression line. 
Now here is a case of a man proposing to deal with a most interesting problem, 
for which quite serviceable data exist, led at once from the track of sound treat- 
ment by the application of this fallacious doctrine of association! And this is far 
from a solitary instance of the harm Mr Yule has done by the publication without 
adequate warning or guidance to his readers of the section of his text-book 
treating of association. 


Dracram XXY. Association of Natality and Mortality in Parisian Arrondissements. 
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15) Further Criticisms of Mr Yule’s Methods of Controversy. 
) ) y 


To certain minor points of Mr Yule’s memoir reply will be made in the present 
section. 


(a) Partial Correlations formed from the Normal Coefficient (loc. cit. p. 627). 
Mr Yule is deliberately confusing two different ideas, the correlation of A and B, 
two continuous variables, for a constant value of a third variable C, with the 
correlation of A and B for a given range of values summed under a certain class- 
index or group of class-indices of the variable C. The former is the only sense in 
which the term partial correlation has been hitherto used, and there is no reason 
why Mr Yule should deliberately confuse this sense with a wholly different con- 
ception, that of the correlation of A and B for a sub-universe of CL The whole 
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theory of the correlation of sub-universes has been dealt with by Pearson* and the 
formulae obtained in 190i were known shortly afterwards to be perfectly general, 
although the proof of this generality was only recently published +. 


Let s, be the standard deviation of the sub-universe selected, o, its standard 
deviation before selection. Let the three characters be represented by the sub- 
scripts 1, 2,3; then if we write 

Pr =e V1 — e/a, pias is V1 — alga 


we have for the correlation within the sub-universe 


_ _ 123 7 PiePis i 
1¥23 = - ; 
VI ar Piz 2/1 c= P13" 
Tog — Tyo? 
and i a 


re Vien 


for the partial correlation coefticient. 


In the case of normal correlation to which Mr Yule is referring, s, is the 
standard deviation of the truncated portion of a normal curve. For his special 
cases when that portion is one-half the frequency curve 


9 
and ey o,. Hence 
TLE 


at a 2 Tai eee 


But it is equally feasible to get almost in a line the value of ,¥. for any 
truncated portion of the normal curve other than one-half, and tables for 
determining the values of ,. and ,4,, the moments of the tail about the severance 
ordinate at #, giving 

sv=o/? (flo = ofA), 


were calculated by Dr Alice Lee and published in 1908§. These functions were 
termed the incomplete normal moment functions. Had Mr Yule paid attention to 
any of this work, he would hardly have published his special illustration and 
remarked “ At present I have not been able to carry the matter further” (loc. cit. 
p- 628). The general formula had been given eleven years ago, and tables from 
which it was quite easy to calculate special cases were published four years ago! 


* Phil, Trans. Vol. 200 A, pp. 1—66. 
+ Biometrika, Vol. vit. p. 437. 

+ Phil. Trans. loc, cit. Hqn. (lvi), p. 25. 
§ Biometrika, Vol. v1. p. 66. 
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Pearson did not give a name to his coefficient ,1,; above, but he carefully 
distinguished it from the partial coefficient ,r,; and stated that the former 
generalised form which did not select at a given value but round it was more 
important for both natural and artificial selection (loc. cit. p. 31). Mr Yule has 
apparently just awoke to the importance of ,Y.3,, but that is no reason why he 
should confuse it in the minds of his readers with ,7,,, or lead his readers to believe 
that we do not know the difference between the two. ‘To avoid confusion of this 
kind in future we shall henceforth speak of ,.; as a singular partial correlation and 
iY.; as a plural partial correlation. For, the former expresses the relation between 
A and B for a single value of C and the latter for a plurality or universe of values 
of C. In actual practice there is little difficulty in determining ,r,; if there is 
enough material, for all we have to do is to take the given universe out of C and 
correlate the resulting A’s and B’s. On the other hand, when we speak of the 
relation of health of child to health of mother for constant employment of mother 
or constant habits of mother, we do not look upon the universe of employed 
mothers as a whole or the universe of mothers with bad habits as a whole. We 
are thinking of employment of mother as a graduated character and parental habit 
also as a graduated character, and we properly use ,7.,; to measure the relationship 
of health in mother and child for a constant grade of employment or constant 
grade of bad habit in the mother. In this case the use of ,7,; has precisely the 
same justification, if 7,3, 7; and 7, are found by tetrachoric tables, as if they had 
been found by product-moments, provided the assumption of a Gaussian distribution 
be reasonably justified for the material in question. There is no other source of 
error in the use of 7; as Mr Yule obscurely seems to indicate. It did not need 
Mr Yule’s numerical illustration (loc. cit. p. 629) to prove that ,¥,; for the two 
sections of an unequally divided normal curve— defectives’ and ‘undefectives’ 
(sic !)—is in neither case equal to ,r.,. The two coefficients have different values 
and different significance whether the frequency be Gaussian or non-Gaussian. 


(b) Mr Yule’s Failure to distinguish between Criticism of Method and 


Criticism of Conclusion. 


We have seen in the course of this paper that Mr Yule’s coefficient of 
association automatically rises in all cases examined when our dichotomy is very 
one-sided. This is very obvious even in skew distributions; compare Diagrams 
XVIII, XIX, XXI and XXII where the rapid increase of @ for small per- 
centages is obvious. Heron working on the Gaussian surface had demonstrated 
that this was an absolute necessity which flowed from the theory and that there- 
fore it must follow, even for surfaces only approximately Gaussian, that two or 
more values of ( were quite incomparable if the dichotomic lines were at 
different percentages of marginal frequencies. He argued that no valid proof 
could therefore be based on the relative sizes of @ in a series of tables for which 
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we knew nothing about the frequency and for which the dichotomic lines gave 
marginal frequencies having any continuously ascending order. 


Such invalid discussions as to the “apparent law that associations were on 
the whole higher where populations were healthier or more defective” and on 
the relation of change of association between two defects with age had actually 
been attempted by Mr Yule*. Heron criticised, it will be seen, not the laws, if 
they be laws, but Mr Yule’s attempt to investigate them by “association.” He 
wrote}: “For precisely similar reasons his discussion of the change of association 
with age must be dismissed as entirely fallacious. There may be, and probably is, 
some decrease of association with advancing age, but the enunciation of such a 
law on the basis of a number of coefficients of association is purely idle.” Heron 
then proceeded to show that the proportions of blind and of mentally deranged 
both increased steadily from infancy to age, and this signifies that if the frequency 
surfaces were of the same type and really had the same correlation at each age 
the @ would steadily decrease with age. It will be seen that Heron’s criticism 
applied solely to Mr Yule’s methods; it was an unanswered and, we believe, 
absolutely unanswerable criticism of the absurdity of trying to deduce laws from Q. 
How does Mr Yule meet it? He writest: “Dr Heron also objects to my con- 
clusion that association decreases with age. His objection appears to be that the 
product sum correlation does not decrease so markedly or regularly with age in 
one of my cases that he examined...and that no evidence has been given that 
the normal coefficient decreases.” 


The destructive criticism that @ for the Gaussian and for all surfaces of which 
we have any practical experience increases the more one-sided is the dichotomy — 
and Mr Yule thinks nothing of -02°/, dichotomies—is not met at all. The criticism 
was of the method of forming an inference, and not as to whether the inference 
led to a law which could be otherwise substantiated. “There may be, and 
probably is, some decrease of association with advancing age,’ wrote Heron, “but 
the enunciation of such a law on the basis of a number of coefficients of association 
is purely idle.” The truth of the law or its falsity is of no great importance, but 
that Mr Yule should reach it by a fallacious method is of fundamental importance. 
Mr Yule seeks by the words “Dr Heron also objects to my conclusion that 
association decreases with age”—an objection never raised—to confuse the really 
destructive criticism, that @, unlike tetrachoric 7;, having no intelligible correction 
for the one-sidedness of its dichotomies is a function of the dichotomic percentages 
and therefore two @’s based on different percentages are wholly incomparable. 


* Phil. Trans. Vol. 194 A, pp. 309 et seq. 
+ Biometrika, Vol. vir. p. 119. 
+ Loc. cit. p. 637. 
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(c) Fallacies involved in the use of Percentages. Coefficient of Colligation. 


Mr Yule has endeavoured to give his Q a physical meaning by deducing from 
it for the “equivalent symmetrical table” his coefficient of colligation w. For 


such a table 


A Not-A 

B ue | Jad Joc 

ne E | J be Jad 
ee | ven Jbe rf a nf 2 


Vad—Vbe 1 (/percentage of A’s percentage of not-A’s 
ape Vad + BS ia {( which are B’s ) ~ ( which are B’s )t j 
We have given grave reasons for doubting the process by which Mr Yule deduces 
this table from his original data. But this very method of percentages itself 
is liable to gross misinterpretation, and illustrations of this occur throughout 
Mr Yule’s text-book*. 
Given a fourfold table : 


A Not-A 
B ae a b a+b 
Not-B... ‘ d c+d 
| 7 ae 7 b+d N 


a 
a+ec 
a Cc 


= —{~ — —— ‘or 1 t a ° 1 is ak 5 
dn = 100 (“ —) for the horizontal treatment. Which is to be taken as a 


real measure of the relationship? Mr Yule in his text-book uses either and 
apparently has some personal scale of values. He gives no probable errors which 
alone could give any soundness to his discussions. Actually we find 


a a: bd 
P.E. Of g, =67'449 Ves AG (b fae 
ae ie cd 
(a+b) (¢c+d)" 
* Theory of Statistics, 1911. 


ae b 
the percentage difference is q, = 100 ( — ra) for the vertical treatment and 


P.E. of q, = 67449 ty, 
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Now here are a few of Mr Yule’s percentage coefficients : 


(1) Imbeciles and Deaf-mutes (Yule, loc. cit. pp. 33—34). 


Totals 


Imbecile | Non-Imbecile 


Deaf-Mute 451 14,795 Qu=0°877 
Non-Deaf-Mute 32,464,323 | t= 2-809 
Totals= “a. | 
(2) Datura (Yule, loc. cit. p. 37). 
| White | Totals 
Prickly 68 W= - 7°840 
Smooth . 15 dn= — 10°882 
83 | 


Totals 


(3) Houses in course of erection in Urban and Rural Districts (Yule, loc. cit. 
p. 62). 


Built | Building | Totals 
Urban 5010 Q=6'714 
Rural 1761 Gn=0°317 

| Mopale | 6709 | 62 6771 


(4) Eye-colour in Father and Son (Yule, loc. cit. p. 34). 


Father. 


Totals 
619 Jy=36'570 


Light 
381 | In=36°457 


Not-Light ... 


Son. 


Totals aa 2% Bi 1000 | 


(5) Developmental Defects and Dullness (Yule, loc. cit. p. 45, where the 
numbers are reduced to 10,000). 


Without Totals 


| With Defects 
Dull 888 1186 2074 Jv = 33°528 


Not-Dull 22793 24213 9n= 36951 


Totals 
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These instances will suffice, and now let us sum them in a table with their 
probable errors. 


| 
cae | Values of q sia Mr Yule’s Judgment on the Table 
— 
(3), Qn | 0°317 +0163 | “Distinct positive Association ” 
(1), YW 0°877 +0°029 “High degree of Association ” 
CONG | 2°809 +0:093 “ High degree of Association” 
I *(8); Ww | 6-714 +3°40 _ “Distinct positive Association ” 
(2); We ~ | 7°840 +5°77 (ies No Association ” 
(2), d, | 10°882 +7°93 | “No Association ” 
i 16); Ge 33°528 +0°68 “ Very high indeed ” 
| (4) Pp 36°457 +2°05 | “ Shows the tendency to resemblance ” 
(4)> YW 36°570 +2°05 _ “Shows the tendency to resemblance” 
| (5), % 36°951 +0°73 “Very high indeed ” 


Now Mr Yule has used the method of percentages in a curious manner; some- 
times he compares a/(a+c) with b/(b+d) but at other times with (a+ b)/N; 
sometimes he uses the percentages found both ways, sometimes only found one 
way. He has throughout failed to give the probable errors of the differences 
of the percentages, which might have influenced his judgment, but he leaves his 
readers to believe that some inference as to the intensity of association can be 
founded merely upon such relative percentage differences. He indeed tells us (loc. 
cit. p. 651) to distinguish between the intensity of an association and the reliability 
of that intensity, so that we must presume that in speaking of the grade of the 
association, he does not form his judgment in relation to the probable error, 
Now in this table we find percentage differences of 78 and 109 belong to 
tables which in Mr Yule’s judgment exhibit no association, but tables with 
differences of 0°3 have “distinct positive association” and of 0°8 have “high degree 
of association.” One table with a difference of 36 merely shows the “tendency” 
to association ; another with the same percentage difference has association “ very 
high indeed.” For any given table there are six ways in which the difference of 
percentages can be enumerated, namely 


a b Uw atb b atb 
ate b+d’ ate Vii b+d NS 
a C a a+ C G ate 
atb e+d’ ath N ’ c+d N 


Mr Yule sometimes uses one, sometimes another of these methods to reach 
his judgment of the degree of association in a table. He has given us his 
judgment with regard to the association of developmental defects and dullness 
for the partial universe of those without nerve signs (Joc. cit. pp. 45—46), he says 
the association is “very high indeed.” It may according to the percentage differ- 
ence chosen be either 2°92 or 36:96. He has further given us his judgment on the 
association of developmental defects and dullness for those with nerve signs, he 


294 On Theories of Association 


says it is “very small.” It may according to the percentage difference be 5:15 or 
0:96; thus the percentage difference might be higher than in the very “high 
association indeed” case in the partial universe of those without nerve signs. 


Finally in the “high association” case of the total universe, the percentage 
difference might be 1°91 or 51:03. Clearly in judging by percentages the con- 
clusion will depend on which percentage is worked out first. Is it not clear 
that, however Mr Yule may have reached his judgments of no, small, high or very 
high association, the percentage difference is not his actual measure and could 
only confuse the tyro in statistics, for whom he introduces this “simple” method ? 
But if difference of percentage has obviously no correlation with Mr Yule’s 
judgment of these grades of association, what weight can possibly be given to the 
coefficient of colligation in determining association? The chief merit of that 
coefficient according to Mr Yule is that it has—what Q has not—a physical 
meaning ; but for him percentage differences of 1:91 and 36:96 alike mark “very 
high associations” and differences of 0°96 and 10°88 alike mark very small or no 
association. We have no standard and clearly Mr Yule has none of how such 
differences of percentages are to be interpreted. Mr Yule has by his treatment 
of percentages a priort destroyed any rational meaning that could be given to 
his own coefficient of colligation as a measure of relationship. Is not Professor 
Edgeworth’s question answered? The coefficient is a “colligation,” not a “profound 
truth.” Mr Yule obviously lays no consistent stress whatever on percentage 
differences in practice. 


(d) Mr Yule’s use of Pearson’s “Transfer” (ab—cd)/N as a 


Measure of Association. 


Among Mr Yule’s many means of testing association—no two of which give as 
a rule the same result—perhaps the most striking is his use of the “Transfer” 
(ab—cd)/N, to which he gives a new name and letter, the ‘common difference” 8*. 
One of us had already suggested that the “transfer” per wnit of the total frequency 
might be used as a coefficient of association of an inadequate character}. It is in- 
adequate because it makes no correction for class-indices and none for the centroids 
of the quadrants. Thus it does not lhe between 0 and 1, and is largely affected by 
the position of the dichotomic lines. Mr Yule has preferred to use instead of the 
transfer per unit of total frequency simply the transfer, his “common difference 
6,’ and the results add further evidence of the vagueness of the whole of his 
conceptions of association. He tells us, to begin with, that “the difference of the 
cross-products may be very large if NV be large, although 6 is really very small... 
the difference should be compared with J, or it will be liable to suggest a higher 
degree of association than actually exists” (loc. cit. p. 37). To illustrate his 


* Theory of Statistics, p. 36. 
t+ Phil, Trans. Vol. 195 A, p. 14. 
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method he uses some of Bateson and Saunders’ data for Datura from the Report 
to the Evolution Commuttee, 1902 : 


Colour of Flower. 


Violet White Totals 


Prickly 
Smooth 


Fruit. 


Totals 


The difference of the cross products is 252—141=111 and then Mr Yule 
proceeds to tell us that “at first sight this considerable difference is apt to 
suggest a considerable association.” He then divides by 83, and writes: “But 
6=111/83=1° only, so that in point of fact the association is small, so small 
that no stress can be laid on it as indicating anything but a fluctuation of 


sampling” (p. 37). 


That Mr Yule is content with this process is evident from the opening words 
of the following paragraph: “While the methods used in the preceding pages 
suffice for most practical purposes, it is often very convenient to measure the 
intensities of association in different cases by means of some formula or ‘coefticient’.” 
We now know what Mr Yule considers “sufficient for most practical purposes”! 
Here are a few tables to illustrate it. 


(1) 
iopae 
266,374 233,626 500,000 | 
Not-B 233,626 266,374 500,000 | 
Totals 500,000 1000,000 | 
(1) 
| Totals 
lB ~ 934,579 | 31,15: 965,732 | 
Not-B... 31,153 3,115 34,268 | 
Totals 1000,000 


Totals | 
999,450 — 
550 


Totals... 999,450 | 550 1000,000 
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The values of Mr Yule’s 6 for these three tables are 
() 16374, (ui) 1940, (ii) 100. 

If anything is to be judged from these results, (i) is far more highly associated 
than (ii) and (ii) again than (iii). All are far more highly associated than the 
Datura, which we will call (iv), and then the order of association sinks from (1) to 
(iv) in a marked manner. 

Now here are Mr Yule’s coefficients of association put against his és 

6: (i) 16374, (ui) 1940, (iii) 100, (iv) 1°34, 
@: G@) “130; (iv) °282, (ai) 500, (1) 296: 

As the one series goes down the other goes up! What, we ask, can be learnt 
from Mr Yule on the subject of association, when his methods, “sufficient for most 
practical purposes,” thus contradict themselves ? 

Here in (1), (ii) and (iii) we have kept the total frequency constant, but 
perhaps the most absurd side of Mr Yule’s 5 is manifest if we alter the total 
of the observations. Suppose Bateson and Saunders had experimented with 
8300 plants instead of 83, then 6 would have been 134 instead of 1:34, The 
association is of course absolutely the same, but how would Mr Yule interpret 
his two 8s? 

We regret having to draw attention to the manner in which Mr Yule has 
gone astray at every stage in his treatment of association, but criticism of his 
methods has been thrust on us not only by Mr Yule’s recent attack, but also by 
the unthinking praise which has been bestowed on a text-book which at many 
points can only lead statistical students hopelessly astray. 


(e) Mr Yule’s Assumption as to Absurdities which must arise if 
Normal Distribution be applied to the “ Blind.” 

Another interesting fallacy is developed by Mr Yule on p. 638 of his paper. 
He writes: 

“Consider for a moment what the assumption of normality of distribution would imply in 
any case where there is an increase of, say, the blind from one age-group to the next. This 
must imply either (1) a fall in the mean of the assumed variable character, goodness of sight, 
I suppose—if the standard deviation is constant or falling, or (2) an increase of the standard 
deviation if the mean is constant or rising. If the first occurs, then there must be some people 
in the later age-group who are much more blind than any people in the first, and fewer people of 
first-class sight ; if the second, there must still be some people in the later group much blinder 
than any in the earlier, and there will also be some of much better sight. On the assumption 
that lies at the base of the normal coefficient, you cannot, in fact, effect a change in the 
numerical proportion of A’s without changing them qualitatively at the same time. The 
assumption seems to me absurd, to be equivalent in this case to saying that there are certain 
people entirely deprived of sight in the first age-group, and certain others more than entirely 
deprived of sight in the second. The normal coefficient is accordingly inapplicable, and its 
precise values of no special significance.” 


We have rarely come across a more specious fallacy. If it were true it would 
be impossible in practical statistics to represent both a population and a selected 
sub-population by normal curves. Let the original population be NV, mean M, 
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and standard deviation &, and let the selected population be n, mean m, and 
standard deviation « Then the second curve will always pass outside the 


first if 
n> 


(m — My +2 (32 — 0) log, (Fz)>°. 

This will happen of course if }=o, Mr Yule’s first case. In this case the 

curves cut on one side only of the means in the point 
L=4(m+M)+ SS o. 

In the case of blindness we should have m>M*, and all it would signify 
would be that after this value, #, of badness of sight, the older age would have 
for each grade of bad sight more individuals than at the lesser age; since the 
two curves both extend to infinity there is no question, as Mr Yule suggests, of 
persons being “much more blind” than at the younger ages. Mr Yule is simply 
confusing in his own mind or in the minds of his readers two senses of “ more 
blind,” i.e. more blind persons of each grade, and blind men of greater degree 
of blindness than actually can occur at the younger age. If he merely means 
to say that the Gaussian extends to infinity in both directions, that is a very 
old objection on the theoretical side to the curve ; it has little value in practical 
statistics, where there is a reasonable approach to normality. 


If m= M, then we have 
/Q>2 nS 
eo=M+ on/ 22, {loge (oN) = log, (n>)} 


ot — >? 


It is necessary therefore for real roots that o>, if oV be >n&. This is 
Mr Yule’s second case, and this would give increasing numbers of persons in 
each grade of good sight beyond the value of # from M for increasing old age. 
We should have thought that this was a very improbable state of affairs, as it 
is almost certain that sight deteriorates with age after childhood at least. If we 
take the 1891 census data+ used by Mr Yule, we have: 


For ages : sa 4S—B5 55—65 
| 
Males in general ... 1,191,789 770,124 
| Blind males eae el 1,752 1,905 
| 


If we suppose in these cases m = M, we find X/> = 2.974 and w/o = 2810, and 
since the dichotomy at “blind” is the same for both curves, we must have X =, 


* We have taken positive axis towards worse sight. 

+ Vol. 111. pp. v. and lvii. Mr Yule has clubbed together those blind from childhood and the 
numbers, 12 times as great for these years, not blind from childhood. That so many acquire blindness 
indicates what a range of graduated sight there must be unless we suppose blindness to arise in- 
stantaneously. 
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or ¢=1:058S. Thus a six per cent. increase in o suffices to provide the increased 
blind population at the greater age; the two curves intersect at « = + 2°968c, or 
somewhat beyond the “blind” boundary. For all grades beyond this there will 
be more persons of each grade of bad sight. There is nothing inconceivable or 
improbable in this; but, on the other hand, there would be beyond # = — 2:968c, 
on the good sight side, a number of grades of better sight with more people at the 
older age. This, of course, is not impossible, but it seems far more reasonable to 
suppose the average sight to grow worse with old age, and in addition to change 
its variability somewhat. If the variability remained the same, as we have seen 
under the first case, there is no excess in the grades of marked good sight in the 
population of older ages. Let us now consider what happens if the mean be 
shifted and the variability increased. Taking m> M*, we have 


x&— Im = (m— My oe > aN oy m—M 
7 SR), 2 emt . Fen 
(Oye a yar a 


= lo ——————e 
> 8e Sno? — >? > 


— — 


Now let us take o= 101%, and therefore for the “blind” groups at 45-55 and 
55-65 we have as before X = 2°9740%, # = 2'8105c = 2'8386% to fix the dichotomy. 
Hence 

(m — M)/>=(X —2)/> = 13538. 
Thus we find (& — m)/o = + 2°6738 or — 162791. 


The former value shows that from some little distance outside the dichotomic 
line each grade of bad sight and blindness has more individuals of that grade at 
ages 55-65 than at ages 45-55. The latter value indicates that the point of 
intersection of the sight curves for the two ages on the side of good sight takes 
place at a point so extremely distant from the average sight that not a single 
individual would occur with such sight in a population many thousand times 
greater than the actual population. 


We think it most probable, however, that a third case, not even referred to by 
Mr Yule, best describes what actually takes place—namely, that the sight at the 
older age gets worse and is less variable, not more variable. To illustrate this, 
take o='99%, then X =2:9740, «= 2'81050 =2°7824. Hence 

(m — M)/X% = '19158, and (#—m)/o = + 24742 or + 16°5875. 

Thus the older age curve now never cuts the younger age curve of sight on the 
side of good sight at all. It cuts on the side of bad sight twice, once somewhat on 
the good sight side of the “blind” dichotomic line, and on the other occasion 
immensely beyond the limits of the populations in question. In other words, 
the older ages have fewer members in each grade of good sight aud more members 
in each grade of very bad sight. This appears to us a perfectly reasonable state 
of affairs, and of course extends far beyond the ratio selected for o/. 

It will thus, we think, be clear that had Mr Yule attempted to turn his 
half-baked notions into figures before he expressed them in words, he would 


* The positive direction of the variate is towards bad sight. 
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have realised that more people in the more blind grades or more people in the 
better grades of sight are not the same thing as some people “much more blind” 
or some people of “much better sight” than any in the earlier aged groups. 
Mr Yule’s alternatives are not real alternatives, he makes no reference to a shift 
in mean and a decrease in variability; such a combination involves only a reduction 
of the numbers in the grades of good sight, a very reasonable result with increasing 
age, and an increase of the numbers in the grades of bad sight, also a very 
reasonable hypothesis. Towards the ‘tails’ of both age curves, theoretically 
there would be fractional units, while in actual observations there would be 
isolated units at relatively wide intervals (cf. Galton’s “ Difference Problem *”). 
What the distribution of such units might be, could not be a priori predicted. 
But it is quite possible for two distributions with slightly separated means and 
slightly different variabilities to give quite reasonable fits to Gaussian curves 
and yet the distribution with the greater variability to have no outlying units 
with “much more” of a character than any which exist in the less variable 
distribution. The variabilities are much more closely determined by the bulk 
of cases with moderately large deviations than by the one or two extreme 
outlying individuals. 

In the case we have last discussed the age-group 45-55 has two individuals 
lying outside 4°653, and the distance between the means being “19S this gives 
446% = 4510 for the corresponding distance on the 55-65 age curve. Outside 
this limit are 2°5 individuals of this older age curve. Are we to say that that 
half individual represents Mr Yule’s necessity for some people in the later age- 
group who are “much more blind” than any people in the first? In truth some 
people would be much less blind, if they would only stay to express their opinions 
in actual numbers before writing them down. The “ minute sifting of numerical 
results” is the foundation of all true statistical inference, and here, as in other 
phases of his recent work, Mr Yule has committed himself to superficial statements 
reached by verbal disquisition which vanishes into nothingness if the touchstone 
of numerical investigation be applied to it. 

There are many other points at which we should like to traverse Mr Yule’s 
statements, but we think we have brought forward enough evidence to indicate 
how unreliable are his methods and how biased are his criticisms. 


(16) Swmmary of General Conclusions. 


In order to sum up the general conclusions reached in this paper, we must 
state first one or two principles which we accept as almost axiomatic : 


(i) There is no universal method of dealing with an n x n-fold table, except 
the method of mean square contingency, leading to a probability measure of the 
independence of the two characters, unless we know: 


* Biometrika, Vol. 1. p. 390. 
38— 2 
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(a) that both characters proceed by discrete units, and are tabled as such. 


In this case the method of pseudo-ranks is identical with that of the product- 
moment, and there never has been doubt as to how the table is to be treated; or: 


(b) that the frequency of the two characters is continuous and that this 
frequency follows or approximates to a definite theoretical system. There is only 
one such frequency system, which has up to the present been effectively discussed, 
ic. that of Laplace, or as it is more frequently but less justly called, that of 
Gauss*. If the distribution be Gaussian or approximately Gaussian, there are 
many ways of dealing with an n x n-fold table. 


(ii) A majority of the cases which occur in statistical practice are so close 
to the Gaussian distribution that methods based upon Gaussian theory will give 
useful first approximations, i.e. correlations within +05 say of the true values. 


Years ago one of the present writers insisted on the non-Gaussian character 
of many variables. But he also remarked on the large number of variables 
which can be described with sufficient practical accuracy by a Gaussian 
distribution. 


The present discussion demonstrates that even with distributions markedly 
skew the Gaussian theory, if applied to 2 x 2-fold tables—without extreme 
dichotomies—will give results not differing by more than ‘05 from the value of 
the true correlation and often differing by much less. Roughly, we may say 
that for reasonable divisions, the divergence between the true correlation and 
that obtained by Gaussian theory is hardly ever of practical importance and indeed 
in “populations” of the size usually dealt with rarely exceeds twice the probable 
error. 

(ii) The coefficient of correlation has such valuable and definite physical 
meanings that if it can be obtained for any material, even approximately, it is 
worth immensely more than any arbitrary coefficients of “association” and 
“ colligation.” 

Starting from these principles we ask ourselves to what data Mr Yule proposes 
to apply his three processes ; 

(a) The Boas-Yulean ¢ for fourfold tables. 
(b) The coefficient of pseudo-ranks. 
(c) The coefficient of association or that of colligation. 


We have shown in this paper that for tables with a finite number of cells 
of the order 5 x 5-fold to 8 x 8-fold, the method of pseudo-ranks must lead to a 
value below and often 40 °/, below the true correlation of variates. Mr Yule has 
stumbled into a statistical pitfall, for he has neglected the fact that correlation of 
ranks is not correlation of variates, and that his correlation of ranks would still 
have to be corrected for the class-index correlations, i.e. he has also neglected the 


* Of course both these writers only dealt with the frequency of one variate; Bravais extended it to 
two, but gives no admissible proof of his formula, which he practically gets by analogy. 
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existence of huge brackets. We may, we hold, entirely dismiss from statistical 
practice this method of pseudo-ranks, except for the case wherein it has always 
been used, i.e. for discrete unit variates, classified by units, where it coincides with 
the usual product-moment method and needs no special name. 


We have only to note here that Mr Yule uses the method of pseudo-ranks, 
which we hold to be demonstrably false, to make very sweeping charges, which 
can be and have been met, against the pigmentation work of the Biometric School, 
He not only suggests that the workers on the pigmentation data were foolish, but 
that they were dishonest. That is the sort of attack which usually recoils on the 
head of the man who makes it, especially when he has for several years worked 
in the Department against which he prefers the accusation. As a matter of fact 
the non-Gaussian character, the variability of tetrachoric 7;, for different divisions 
was recognised very soon after it had been applied. But the investigations then 
made and more amply illustrated in this memoir indicate that the values originally 
given were substantially correct, the inheritance of intensity of pigmentation 
between parent and offspring hes between 46 and ‘50; it is not of the order 4 
as Mr Yule asserts on the basis of a theory which we feel convinced he will have 
to withdraw, if he wishes to maintain any reputation as a statistician. 


We have shown in the course of this memoir that the coefficient of asso- 
ciation Q, if treated merely as an undefined measure of association, has not 
for varying dichotomies the stability of the tetrachoric coefficient and it appears 
to have no reasonable physical meaning even for the cases which he has selected. 
Mr Yule has deduced from it a second coefficient, that of colligation, which has, 
he says, a physical meaning, when the table has been dressed in an artificial 
manner, namely it signifies on this artificial table the excess of the percentage 
of A’s that are also B’s over not-A’s that are also B’s. We have shown from 
Mr Yule’s own writings that such a difference of percentages has in his own 
practice no meaning at all from the standpoint of association. 


Mr Yule never tells us clearly when we are to use one or other of his co- 
efficients. He spends 16 pages of his memoir on discussing the application of 
his coefficients of colligation and association to the vaccination data; yet on p. 611 
he writes: “For discontinuous attributes—attributes proper, as we might term 
them—the true correlation is that given by formula (24) or (26) [1Le. the Boas- 
Yulean coefficient or Pearson’s ¢]; we are dealing with a variable in fact, which 
can only take two values as distinct from a variable exhibiting a normal or any 
other continuous distribution. Tables I, III and IV [ie. the vaccination data, 
as it seems to me, represent precisely such a case.” Here Mr Yule has given up 
colligation as applied to vaccination ; if so why devote 16 pages to its discussion ? 
But 20 pages later Mr Yule tells us that: “For investigations on smallpox and 
vaccination such as those of Brownlee and Macdonell and Turner, the use of Q or 
w would, in my opinion, have been more illuminating as well as simpler than the 
use of the normal coefficient” (p. 631). 
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Mr Yule does not really appear to know which of his ducklings to prefer, 
even for what in his estimation—although not in ours—are discrete attributes. 
We are quite clear that none of them are appropriate. Still less are they 
appropriate for the definitely continuous data to which Mr Yule and his disciples 
apply them. 

The controversy between us is much more important than an idle reader will 
at once comprehend. It is the old controversy of nominalism against realism. 
Mr Yule is juggling with class-names as if they represented real entities, and his 
statistics are only a form of symbolic logic. No knowledge of a practical kind 
ever came out of these logical theories. As exercises for students of logic they 
may be of educational value, but great harm will arise to modern statistical 
practice, if Mr Yule’s methods of treating all individuals under a class-index as 
identities become widespread, and there is grave danger of such a result, for his 
path is easy to follow and most men shirk the arduous. 

The very large amount of arithmetic involved in this paper would have been 
impossible without friendly help from a number of our colleagues; we have 
especially to thank Miss Julia Bell and Mr Herbert E. Soper; the former for 
much work in calculating and the latter for diagram draughtsmanship as well as 
calculation. Miss Ethel M. Elderton also most kindly undertook one or two 
pieces of heavy arithmetic. We can hardly hope to have escaped numerical slips, 
but we feel confident that such slips, if they occur, will as frequently tell against 
us as for us; and we have not, knowing the fallibility of the best calculators, 
done more than draw attention to points where our arithmetic differs from that 
of Mr Yule. We have laid sole stress on errors of interpretation and on fallacious 
theory. 


APPENDIX I. 


ON THE FALLACY OF ASSERTING PERFECT ASSOCIATION WHEN ONE 
QUADRANT IN A FOURFOLD TABLE IS VACANT. 


In all the values of the probable errors hitherto determined the constants 
of the distribution found in the formula are truly constants of the actual 
distribution of which the population under discussion is a sample, Because we 
do not know the actual distribution, we replace its constants by those of the 
observed sample. This method will as a rule not lead us astray, but it may do 
so grievously in cases where the observed frequencies take limiting values. For 
example, consider the population described in the following fourfold table: 


TABLE I. 
| a eee ce orale oe 
Beet ae: 971,138 994,000 
Not-B... 5,862 6,000 
Totals 977,000 23,000 1000,000 
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and let this distribution be the real population or an absolutely proportional sample. 

: : a|b 
Now for this population with the scheme ala’ we have ad — bc =0, and all our 
measures of association vanish. If we take a sample of n’ from such a population 
of n, the value » =(ad — bc)/n? will not for the sample be zero, but, if 


‘= (a'd’ —b’c')/n®, 


where a’, d’, b’ and c’ are the values in the sample, will have a standard 
deviation * 


(a+ b)(a+c)(d+b)(d+c)_ 1 
i ee — “xX —. 
n Vn 
Now let us compare this with 
ye by(atce)\ad+v)(d'+c) 1 
0, = 74 x =F ) 
n Vn 
the value of o, derived from the sample itself. Now to units the value of the 
sample might be 


TABLE IL. 


Not-B ... 


Totals 


and it is clear that o, and a, will be exactly the same, as they depend only on 
the marginal totals. Thus 


1 
Oy = Oy = = V'994 x 977 x 006 x 023 
1000 


= 000366. | 
But 7’ =-000138. Accordingly (ad — bc)/N? when d = 0 is not significant, having 


regard to its probable error, and the association is zero. On Mr Yule’s theory 
Q=—1 and its probable error is zero. 


Now the standard deviation of Q, for @ zero, is 


i (a+c)(c+ d)(b +d)(at+ b) 
Vn’ (ad + bey? 4 


for a sample of n’, where a, b, c, d refer to the original population. 


Let us consider the differences for the above material which will arise from 
calculating og on the population in Table I and on the sample in Table II. On 
Table I we find »cog=137; on Table II ,og’ = 2°62. Hence either Table is 


* Pearson, ‘‘On a novel method of Regarding the Association of two Variates,” Drapers’ Research 
Memoirs, Biometric Series, No. vit, Dulau & Co., 1912, p. 7. 
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absolutely compatible with Q being zero. It is true that the value of og as 
estimated from the sample is nearly double what its value would be truly esti- 
mated from the population, but both values suffice to show that Q=0 is as 
reasonable an hypothesis as to the constitution of the material as Q@=—1, and 
far more reasonable than Q=—1+0. The method of mean square contingency 
gives us y°='1455, leading by Palin Elderton’s Tables to P=-971, or if the 
material were truly independent only in three cases per hundred should we get 
a better fit than Table IT to Table I in taking samples of 1000. 


The fact is that in drawing random samples of 1000 from Table I, the 
distribution of frequency in the cell d is given by 


(999862 + -000138)"" = ‘8712 + 1202 + 0086 (for all terms beyond second). 


Hence in 100 samples of Table I, d would be zero in 87 cases, unity in 12 and 
greater than unity in about one case. In other words Mr Yule’s Association 
Coefficient would for material with true zero association be —1 in about 87 °/, 
of cases and of the order +°75 in 12°/, other cases. In all these cases, however, 
the probability method shows practical independence. It must accordingly be 
recognised that it is extremely dangerous, if zero frequency be found in one 
quadrant, to assert that Q=+1+0. For, a population of zero association would 
give such a value in 87°/, of samples of 1000 in a case like that under consideration. 
The tetrachoric method fails also for this extreme case, for the simple reason 
that the very continuity of the method excludes thinking in isolated units. 
If the real population were that of Table I, then on the basis of continuity we 
should expect ‘138 from the infinite skirt of the Gaussian surface in quadrant d. 
But we might extend this volume of the skirt up to something under ‘5 before we 
should anticipate a whole unit to appear in d. There is no suggestion of this 
kind about Q, for Mr Yule directly discards all conception of such continuous 
frequency surfaces. This point must be borne in mind in applying tetrachoric 7% ; 
a quadrant of zero frequency does not necessarily signify that in the theoretical 
Gaussian distribution this quadrant would have zero frequency. We may equally 
reasonably assume it anything up to ‘5. In such a distribution as 


TABLE IIL. 


| 
| A Not-A 
| 
| 
3 Ree 21 450 
INOt= Bi... 529 (0) 


Totals 550 | 450 


we shall alter the tetrachoric coefficient from negative unity to a high value 
somewhat less than negative unity, by inserting anything up to ‘5 in quadrant d, 
but we shall not swing the correlation from —1 through 0 to a small positive 
value by the process. We shall get a reasonable minimum value for 7;, and we 
shall be convinced that 7;=—1 + 0 is not necessarily a true representation of the 
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state of affairs, although the correlation is negatively high. But in such a table 
as Table II the whole character of the correlation is changed by inserting a 
frequency w, less than ‘5, in this d quadrant. 


The first question to be answered is: If we put a small frequency a in d, how 
are we to take it from the other quadrants? The only reasonable answer here 
seems to be: “Take it in proportion to their frequencies.” Thus Table II 


becomes of the form: 
TABLE IV. 


oo ' & 
6 ( = im) d “+6 (1 a Tee) 


av we 
Til == er Oo Ieee ( 
977 (1 700) an (1 7000) ae 


Now can we choose w so that ab —cd =0, for this Table? For this we must 


have: 
Se pe eee) von 2) 
71 (1 = a955) X 2-23 (1—go99) * (1 aH90)> 
which leads to «='1421. 

This shows us that on the Gaussian hypothesis a value of « absolutely 
consistent with a zero being recorded in that quadrant leads to zero association. 
This for most practical purposes would be sufficient. But a little further con- 
sideration here is desirable; we may write the table in the form: 


TABLE YV. 
970:8620 — a | 22-9967 +2 | 9938587 
599924 ‘1491 —2 61413 
9768612 | 23-1388 1000 


Here «=0 gives a table of zero association, a ='1421 gives, if we can only 
record to a unit individual, the table of experience, i.e. 


TABLE VI. 
971 | 23 994 

6 0 | 6 
977 | 23 | 1000 


Now 7 of Table V =— q000" Therefore the probable error of «= 1000 times 


that of ». If we find ,o, for e=0 in Table V, we have for its value ‘0003714, 
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quite close to the value we obtained for Table I from which we originally deduced 
Table II. Hence the probable error of « ='67449 x 1000 x oo, ='2505. It will 
be clear therefore that with such a probable error for x, the frequency of the 
d quadrant might actually be anything from 0 to ‘5, and yet the Table of 
experience takes the form VI, when we can record only by units. 

Suppose d had been *5, «= — ‘3579, then we have for the theoretical table: 


TABLE VII. 


971-2199 226388 | 9938587 
56413 5000 | 61413 
| | 
1000 


976°8612 | 23°1388 


which solved by the tetrachoric process gives 
1 = + °224 + 187, 
i.e. the value of 7; is not significant*. 

On the other hand if we put d=0, i.e. deal with Table VI, tetrachoric 7, is 
unity, because on the Gaussian hypothesis only complete association is compatible 
with absolute zero in this quadrant and then any sample will exhibit absolute 
correlation. But, as we have just seen, the zero in quadrant d could arise from 
samples of a population in which d was small but not zero, and that in a particular 
case 87°/, of samples of a material with zero association would show this zero 
quadrant. The accompanying diagram gives the values of tetrachoric 7; for 
various hypotheses as to The dotted rectangle is bounded by the lines which 
give vertically twice the probable error of # for « =0, and horizontally twice the 
probable error of 7; for true 7,=0. We have also placed once the probable error of 
tetrachoric 7; from the plotted 7, on either side giving the broken curve. It will 
be seen that from the case of d='5, 7, =+'2244°187 up to d=‘0025 and 
r, = —°400 + °779, there is no significance in the values of 7; found by the tetra- 
choric process. Even after this we cannot assert that the values of 7, obtained 
would be significant}; for the proof of the formula of the probable error of 7, 
depends upon éd being small as compared to d. Now og=Vd(1—d/N)=Vd, 
very nearly when d is very small. Hence 8d/d is of the order Vd/d = 1/Vd which 
will be large if d be less than unity}. The probable error of 7; = 1 is only zero if 
d is absolutely zero for the population which is being sampled and not if it is 
merely zero in the sample. But we only know that population through the 
sample, and we see that in such cases as we are considering the zero in the 
sampled population is only likely to occur in a small proportion of the cases dealt 
with. The tetrachoric process clearly fails im such cases, but we see that with 


* If we apply Q we have Q= 584+ °332, pointing rather more in the direction of significance, but 
such application of Q is illegitimate, as the fractionising of the theoretical surface has no meaning for a 
coefficient which is based on complete neglect of the nature of the frequency-distribution. 

+ In fact the tetrachoric 7, series-equation rapidly becomes divergent and the formula for the 
probable error takes an indeterminate form. 

+ The like failure occurs in Mr Yule’s proof for the probable error of Q, although he has not 
warned his'readers of this; it is accordingly not applicable, if one quadrant has zero frequency. 
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any reasonable assumption with regard to a, the tetrachoric process suggests not 
perfect but zero association; the perfect association is only reached as a limiting 
case, although Mr Yule’s coefficient gives an unhesitating unity*. 


6B 


© (o¥) 
Values of 'Positive r, 


, 


-P.e. of r-% -Pe. of'r-> 


Pa 


, 


TA 3 
Values. of od 


(o>) 


o------> 


, 


----Twice RPE.or rn 
of Negative Fr 


' 
ow.” 


Values ~* 


* In the Table selected by Mr Yule to exhibit the variations of his coefficient of association (Phil. 
Trans. Vol. 194 A, p. 277) 51 out of the 164 values shown are +1 owing to the occurrence of a zero 
quadrant. With the introduction of a ‘5 into this quadrant—quite reasonable on the basis of a con- 
tinuous variate—his values may be reduced from +1 to +:72 or swing over from —1 to +:98, ete., 
etc.! What is the value in such cases of the statement that there is “ perfect association’’? 


39—2 
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We have seen that if the sampled population be really one of zero association, 
the sample may have in quadrant d in 87 °/, of cases zero frequency, but in 12°/, 
of cases unit frequency. This unit must—working to unit individuals—be with- 
drawn from a,b andc. The most probable case is the addition of a unit to both 
aand d and their withdrawal from 6 and ¢, or again we may draw units respectively 
from a, b or c. We may consider only these four cases: 


(a) 972 | 22 | 994 (8) 971 | 22 | 993 
5 | 1 6 Gulla 7 
977 | 23 | 1000 977 | 23 | 1000 
(y) 971 | 23 | 994 (5) 970 | 23 | 993 
tlie 6 6 | 1 7 
976 | 24 | 1000 976 | 24 | 1000 


These lead to 


a B y 6 
= a | a eae 
Value of tetrachoric 7 ... ‘394-15 | ‘36416 | 384-15 | 35+ -16 
p.E. of 7,=0 for same marginal totals... 00 + °27 | 00+ °25 | 00+ -27 | 004-25 
Value of Association QY . ‘80 E14) 7616 ‘78+°13 | -75+:16 
p.b. of Y=O0 for same marginal ‘totals | 00 + '23 | 00+ °24 | :004+°23 | 00+°25 


It will be seen that 7; is always less than 2°5 and Q always greater than 
45 times the probable error. The value of 7; obtained is always less than 
15 times the probable error of 7,=0 for the same marginal totals, while Q is 
3 times the probable error of Q=0 for the same marginal totals. Thus the 
tetrachoric method warns us that the association is probably zero, while the 
association coefficient emphasises a high value of the association. 


To sum up: The correct process in these cases with zero in one quadrant 
is not to assert that the association is perfect with Mr Yule, but to apply first the 
probability test and determine whether the material may not rather be a random 
sample from an original population of zero association. The failure of the tetra- 
choric 7; in these cases is rendered evident in the working, we reach non-convergent 
series and are thrown back on a limiting case; if we place in the zero quadrant a 
small frequency less than 0°5, which would correspond to zero in the actual table, 
we find a finite value of r;, but one non-significant having reference to its probable 
error, unless we approach close to the limit of d= 0, in which case the probable 
error of 7, is so far undetermined, because the ordinary process fails to be valid. 


Mr Yule’s view that association is perfect when there is zero in any quadrant 
ignores the fact that he can only deal with a sample of the true population, and 
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so far as we are aware he has not warned students of the extreme danger of his Q 
in these cases. The non-applicability of tetrachoric 7; to cases in which one of the 
quadrants has zero or small relative frequency is, on the above and on other 
grounds, well-known to those who habitually work with it, and the rule has long 
been to avoid where possible all such extreme dichotomies. This point has been 
largely disregarded by Mr Yule in his criticism of tetrachoric methods. 


APPENDIX II. 


On THE TEST OF GOODNESS OF FIT OF OBSERVATION TO THEORY 
IN MENDELIAN EXPERIMENTS. 


An objection to the y?, P, test of goodness of fit, has recently been raised with 
somewhat unconscious humour by certain ardent Mendelians. A simple illustration 
may be taken. Suppose the mating to be 


(DR) x (DD) =50°/, (DR) +.50°/, (DD). 


For example, let there be 1000 offspring and let 480 of these be recognised by 
later experiment as (DR)’s and 520 as (DD)s. Then the standard deviation is 


41000 x4 x4=15'8 


and the observed deviation is 20, and P=about ‘90, or the fit is quite good. 
But suppose the observations are 


480 (DRYys 519(DD)s and 1(RR) 
Le. Observation: 480 519 1 
Theory : 500 500 0 


observation — theory) 
y= gf Semen atta aa ry) = 


Clearly eheoRy OD, 


and P = 0, or the probability of observation being a random deviation from theory 
is zero, there is absolute badness of fit. Hence either theory or observation is at 
fault. It is not, however, the theory of “goodness of fit” which fails in such a 
case, but the Mendelian theory which wants mending. If we put a@ black balls, 
b white balls and c red balls in a bag, the theory will tell us whether any sample 
of black, white and red balls can be reasonably considered as a random extract 
from that bag. But if we are presented with a series of black, white, red, and 
green balls and asked what is the probability that these were drawn from that bag, 
we must assert that it is zero, however concordant with theory the results for the 
black, white and red balls alone may be. There are only two courses open to the 
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experimenter who gets green balls however few in number out of a black, white 
and red ball theory : 


(i) to assert that they are anomalies due to his temporary colour-blindness, 
to mistakes in his observations, or to some misinterpretation of his results, which 
are not accurately describable in terms of his original categories, or 


(ii) to amend his original theory by inserting some green balls a posteriori 
into the bag and starting afresh to calculate his probabilities. 


The one course weakens the weight we must lay on his record or on his choice 
of categories; the other tends to discredit his a priori theory. To adopt a third 
course and assert that we want a new test for “goodness of fit” of theory to 
observation, which shall cover such discrepancies, ie. which shall slur over 
divergencies between a priort theory and a posterior: results, may appeal to 
our sense of human fallibility but scarcely to our appreciation of scientific logic. 
We shall be left with the suspicion that the theory is plastic and the observations 
elastic. What criterion of “goodness of fit” can the theory of probability provide 
when it is a case of applying plastic theory to elastic observations? The answer 
surely is none whatever until the plasticity of the theory has been quantitatively 
studied, and until the errors of the record have been quantitatively stated. Hither 
we must be told that the observer will mistake a red ball for a green one in 
so many per cent. of cases, or we must be told that the theory will be inaccurate 
in so many per cent. of cases. Personally we think it possible that all attempts to 
find a “good fit” of a plastic theory to elastic observations are idle. It is a con- 
sideration of the green balls, which are said not to be in the black, red and white bag 
at all, which is often the basis of marked scientific progress. That atmospheric 
nitrogen differed from pure nitrogen was just such a “ green” ball; but the plastic 
theory that air consisted of oxygen and nitrogen only had been confirmed by many 
elastic observations before Lord Rayleigh followed up his “ green” ball. 


These remarks are suggested by the following paragraphs in two recent 
Mendelian publications. Dr Raymond Pearl writes* concerning Mendelian 
data : 

“A determination might be made of the ‘goodness of fit’ of theory to 
observation by Pearson’s method, were it not for the fact that that method 
cannot be applied to cases like the present.” 


Dr Pearl says it cannot be applied because he finds green balls, where his 
theory puts only black, white and red into his bag. It is either his observation 
record or his Mendelian theory, not the mathematics of “goodness of fit,” which 
needs modification. Dr Pearl continues in a footnote as follows: 


“The difficulty hes in the fact that Pearson’s test depends upon a variable 


v=s me na ml 


mM, 


* The Journal of Experimental Zoology, Vol. x11. p. 2038. 
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where m, is the theoretical frequency and m,’ the observed. Now obviously in 
any distribution where even one m, is zero, the value of y*? must be infinite 
whatever may be the values of the other m,’s or m,’s. That is, if the theoretically 
expected frequency on any base element is numerically zero, the probability against 
the whole curve becomes infinite. Thus, for example, suppose a system of 
frequencies like the following, a type which is continually arising in Mendelian 
work : 


Class di 2 3 4 By) 
Theoretically expected frequency ... 595 827 68 0 96 
Actually observed frequency be 594 828 67 1 96 


“ Now it does not need a mathematical measure of any kind to tell one that in 
this case the theoretical and actual distributions are in very close agreement. 
Yet because the theoretical frequency on class 4 is zero, the probability by 
Pearson’s test is literally infinite against the observed distribution being regarded 
as a random sample of a population distributed in accordance with the theoretical 
frequencies. Pearson has indeed himself noted what is essentially this same 
difficulty in using the test on ordinary frequency distributions*.” 


Now what does this paragraph exactly signify? Interpret it in coloured balls ; 
white, red, black and yellow balls are placed in a bag in large numbers in the 
proportions of 595 : 827: 68:96. There are no green balls in the bag. One 
such green ball is said to have been drawn. ‘Theory says it is an impossibility, 
and the criterion of goodness of fit says its improbability of occurrence is infinite. 
We can conceive no logical theory doing anything else. Dr Pearl does in fact tell 
us that cases in which a ball, classified as green, comes out of the theoretical 
white, red, black and yellow ball bag are of “a type which is continually arising in 
Mendelian work.” This at any rate is a frank admission. As a matter of fact 
with his arbitrary division between “over 30-egg” and “under 30-egg” hens, 
we are not surprised that “green” balls appeared not only in ones, but in twos 
and even in fours, and in a few cases to even more, although this is attributed to 
“ physiologically extremely favourable” matings (loc. cit. p. 248) as apart from 
gametic theoryt. What Dr Pearl is seeking is a plastic theory or an elastic 
record, not a real criterion of goodness of fit, which must give no finite proba- 
bility when green balls come from a bag which contains no green balls! Dr Pearl 
continues : 


“The point noted obviously limits greatly the applicability of Pearson’s test, 
and in a most unfortunate direction. Tests of goodness of fit are much needed in 
Mendelian work [we cordially agree !]._ But it is just here that the classes where 


* This is a complete misunderstanding. Pearson says that you must not in the case of continuous 
variation make use of classes which theoretically have each less than unit frequency, where the record 
goes by individual units only. 

+ Until Dr Pearl publishes the actual record of each bird, and not merely its class-index, and the 
same for its ancestors, it is impossible to estimate his degree of justification for the theoretical 
treatment of his results. 
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the theoretical frequency is zero often occur*.” In other words, observation 
records something as occurring which existing theory says cannot occur, and 
Dr Pearl asks for a criterion which shall make the impossible only mildly 
improbable. He must either remould his theory or explain away his observations. 
We see no alternative. 


Dr Pearl’s demand for a criterion which shall not be crucial, but allow elastic 
records to fit a plastic theory, is well illustrated by the following paragraph from 
the work of another American Mendelian who is convinced that “feeble-mindedness” 
is a Mendelian unit character, but has found his “green ball” in the normal 
offspring of two feeble-minded parents: 


“These two are apparent exceptions to the law that two feeble-minded parents 
do not have anything but feeble-minded children+. We may account for these 
two exceptions in one of several ways. Either there is a mistake in calling them 
normal, or a mistake in calling the parents feeble-minded; or else there was 
illegitimacy somewhere and these two children did not have the same father as 
the others of the family. Or we may turn to the Mendelian law and we discover 


* The remainder of Dr Pearl’s paragraph runs: ‘‘ To determine the probable error of the individual 
frequency in measuring the goodness of fit of Mendelian observation and theory, as was first practised 
by Weldon, and later by Johannsen and by Mendelian workers generally, does not appear to the writer 
to be an altogether sound procedure. It fails to take account of the correlations in errors amongst the 
several frequencies. Yet these are just as important and just as certainly existent in a Mendelian 
‘category’ type of distribution as in the ordinary variation polygon of a continuously variable 
character....Pearson’s test covers this point, and were it not for the other difficulty noted above 
would be much more widely useful in Mendelian work than is actually the case” (loc. cit. p. 204). 
The ‘‘ widely useful”? test in Mendelian work is quite obviously one which will overlook negation of 
theory, or not drive the observer back to question the validity of his records or his categories. But 
there is a misstatement in the above sentence which needs correction. If there be only alternative 
categories, e.g. the total of (RR)’s compared with the total of (DR)’s+(DD)’s, then Pearson’s test is 
absolutely identical with the probable-error test. This is of course well recognised; for, if m; and mz be 
the observed frequencies and n; and ng the theoretical frequencies, 


5 (my— 24)? (Mg— No)? if oe 
= iy =a =e 
x Ny e ng u Z i) Ny ay Ng 


(m=)? (mg— Ng)? 


m ( -3) ng (1-3) 


(Deviation of either category)? 
(Standard Deviation)? : 


, Where N=n, +79, 


ao 


and for this case P= all ~ 2X" dy. This test therefore Weldon applied with perfect legitimacy 
2a 


e 
x 
to the consideration of Mendelian quarters. When Weldon came in the very paper cited by Dr Pearl to 
test more complex Mendelian results, he did not fail to take account of correlations in errors, and 
actually applied Pearson’s criterion (Biometrika, Vol. 1. p. 235). Dr Pearl’s sentence therefore requires 
remodelling; he has clearly failed to appreciate what Weldon was doing. 

+ The pedigrees published by Weekes and Goddard show other exceptions to Mendelian law, which 
they appear to have overlooked. This is confirmed by similar pedigrees in our Laboratory. The very 
idea that the continuous and highly variable character ‘‘feeble-mindedness” is a ‘‘unit character” 
in the Mendelian sense will do much to check real research into the grave complexities and difficulties 
of this very vague and broad category. 
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that according to that law there might be in rare instances such a combination of 
circumstances that a normal child might be born from two parents that function as 
feeble-minded. For practical purposes it is, of course, pretty clear that it is safe 
to assume that two feeble-minded parents will never have anything but feeble- 
minded children *.” 


The italics are ours. They are very typical of the manner in which an elastic 
record and a plastic theory are made to fit. No account is anywhere provided 
of this extension of Mendelian theory which “in rare instances” provides a normal 
offspring to the two parents with an abnormal dominant character. Possibly, as 
in Dr Pearl’s case, it is due to the occurrence of “physiologically extremely 
favourable” matings. Anyhow the old definite simplicity of Mendel’s Mendelism 
has gone; with an elastic record and a plastic theory any data may be Mendelian 
—or not—according to the views of the investigator who moulds his theory and 
stretches his facts. 


How welcome to such an one must be the Yulean theory of association ! 
“ Whatever the nature of the classification, however, natural or artificial, definite or 
uncertain, the final judgment must be decisive; any one object or individual must 
be held either to possess the given attribute or not” (Yule, Theory of Statistics, 
p- 9). , 
In the face of such a direction, how could Dr Pearl have been so foolish as to 
balance a number of his hens on the dichotomic fence of 30-eggs, and allow a 
moiety of each such hen to possess one Mendelian attribute and the other moiety 
its alternative ? 


It is not difficult to understand, however, why Dr Pearl does not like Pearson’s 
criterion of the goodness of fit of theory and observation. On p. 255 of his 
memoir he gives a table “showing the observed and expected distributions of 
winter egg production for all matings taken together.” He remarks on this 
table that “the lumped figures do not give an altogether fair estimate of the 
matter, but some sort of a summary is necessary.” We agree very cordially 
because a number of the impossible green balls of the subsidiary tables do not 
appear as such when the tables are lumped, but taking the data for what they 


are worth we have the following four series: 
Winter Egg Production 


———— 


Over 30 Under 30 Zero 

I. Observation ... re 365°5 259°5 31 
Theory as is 381°45 257°25 17°30 

Il. Observation ... ine 2 23 15 

Theory Fc see i) 25 15 

III. Observation ... ais 36 79 8 
Theory — wee 26°5 86°75 Pris 

IV. Observation ... m. 575 98°5 23 
Theory sity res 68°60 95-0 15-40 


* The Kallikak Family, p, 114. 
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Pearson’s criterion gives the following results : 


I. -P=<003 
Ii ==000 
Ill, P=-113 
LV ee "056 


The odds against the first series arising from material following the theory are 
332 to 1; the second series is impossible on the theory; the odds against the third 
series are about 8 to 1; and against the fourth series are about 27 to 1. The com- 
bined odds against even the three series (I, III and IV) representing the theory 
are very large indeed. Dr Pearl actually tells us that “the investigator is usually 
expected to reject abnormal material” (p. 256). And he prides himself on not having 
done so*, and asks us to form a judgment not on the summary but on the detailed 
data in the body of the paper. We have done so, and the criterion gives still worse 
results. We agree with Dr Pearl that “the high producing hen, somewhat like 
the race horse, is a rather finely strung, delicate mechanism, which can be easily 
upset, and prevented from giving full normal expression to its inherited capacity 
in respect to fecundity” (loc. cit. p. 255). But surely this is only to admit that 
the character chosen was wholly unfit to test the theory upon at all? It does not 
justify rejecting the only scientific test of “ goodness of fit,” and then concluding 
from nothing other than general impression that “the cumulative probability 
that the hypothesis applied represents at least a reasonable approximation to 
the true interpretation of the results becomes very great” (loc. cit. p. 257). If 
“cumulative probability” signifies anything at all, it means the theory of 
probability applied to the series to deduce combined odds against the total 
results and these are hopelessly against Dr Pearl. Further we cannot go until 
Dr Pearl publishes his record, which is not yet before us, although he has 
published his own interpretation of it in a great variety of journals. 


APPENDIX ITI. 
ON THE EQUATION TO THE SURFACE OF CONSTANT Q. 


If the limits to the frequency range in the variate # be a and a’, and in the 
variate y be b and 0’, then in the notation of p. 184 


a fb 
Neng = | zdudy, 
aJsy 
a fb a b 
P -| | zdudy, a=| | zdudy, 
a/b aJy 


en AB Nyg 
dady~ 
* « But in view of the rather hysterical attacks upon geneticists and their method of work in this 
country, if for no other reasons, it seems best to follow the plan of publishing all the data.’ We 
would remind Dr Pearl that this is exactly what he has not done. We require the quantitative record 
of every individual hen and its ancestry as far as is known before we can fully test the validity 
of his results. 
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By the equation in the second line of p, 184, we have 


d= x) p+ QtN Vin -C- xy) (pt OPh+ 4x0 —x) pa 
Nong = TL) 4 


Whence by straightforward but somewhat laborious differentiation we find 


He go 2(1—) py} 


This is the equation to the surface of constant @. As soon as we know the 
nature of the marginal frequencies, i.e. the values of p and q, we can find the form 
of the surface. The above equation is somewhat simplified if we refer p and q 
to the medians of their frequency-distributions, i.e. write p=}N—a, q=4N—-B. 
In this case 


da dB le tee = 
= FE es Ce = 21 4) as} 


(XN = (y= 1 [a+ B= x (@= BY} 


If the marginal frequencies are Gaussian, 


N i! = N a 
fo eee e 201" d : = — | é 2 02 dy, 
V2r0,/0 B V 2m! 0 
1 22 Ly? 
da  N a9 @_ Nyasa 
da V2, > dy Vr op 


It is therefore possible by aid of Sheppard’s Tables to construct the contour 
lines of the surface of constant Q for this relatively simple case. But the surface 
is far from simple and its complex equation seems to indicate that association as 
measured by @ is of a very arbitrary character. We have constructed the surface 
of constant association for the special case of Gaussian marginal frequencies, when 
Q='6. The photograph of the surface, the regression lines and the contours will 
be published on another occasion. It suffices here to note: (1) that the arrays are 
heteroscedastic, varying from homoscedasticity of the mid-section to a skewness of 
‘16 when w/o, = 1°5 and to a skewness of ‘20 when 2/o,=3°5. (ii) The regression 
line is most markedly skew, in shape like a Galton ogive, so that there is a 
maximum of regression, and therefore correlation, at the centre of the surface, 
while the regression and therefore correlation reduce to zero as we move outwards. 
No frequency surfaces in actual practice exhibit, as far as we are aware, these 
features demanded by constant Yulean association. 


40—2 


MISCELLANEA. 


I. The Correction to be made to the Correlation Ratio for Grouping*. 


By STUDENT, 


Using the ordinary notation viz. n,,=the number in the w array of y’s whose mean is at “p, 
Yry=the mean of this array, V the total number in the sample, and 7 the general mean of y, 
we have 7” defined by the relation 
S {nap Yap — ¥)"} 
Neji eeutuetar demo (1). 


9 


i= 


If n? is required to fit a regression curve to the actual observations as in Professor Pearson’s 
original memoir “On the General Theory of Skew Correlation and Non-linear Regression,” no 
correction is necessary. 

But if we require a ratio which shall remain constant under wide variations of grouping 
and of number in the sample and which shall consequently be more comparable from one sample 
to another, there are two corrections to be made. 


The first of these has already been given by Professor Pearson (Biometrika, Vol. vu1t, p. 256), 
and he has expressed it as follows :—If 7? be the value of 7? actually found by the use of (i), 
and 7? be the value which would be found from an infinitely large sample, then if x be the 
number of w arrays 


But there is a further effect of grouping which has not hitherto been noted and which can be 
evaluated as follows: 


Suppose the wp, array to be divided into elementary wv arrays and let y, be the mean of 
the w, elementary array and n, its frequency. 
Then clearly the proper contribution of the wp array to 7? is 
Sir (Yo- Y} 
Nie ; 
This is equal to 
S| Rp Yg—Y+Yp 7 Yap} 1 
ae = 
No, No, 


LS {0p Gey — 3 +28 {iy Dep D) Yo — Fap)} +8 {ty (In — Fag) V1 


Now ¥,,—¥ is of course constant for this summation, S (m)=Nz, and S {np (Yo — Yap)t =9; 
herefore the contribution to 7? 
Nay (Yary— Y)” S {np Yr— Yup) aa 
=— No? ar Vege Ga (ii). 


* See above p. 118 of this Journal. 
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The first of these two terms is that which is obtained in the ordinary way, so the contri- 
bution of each array should be corrected by the addition of the second term and 7? itself by 
the addition of ee 

ea ae 
No,” 

Now if Professor Pearson’s correction (ii) has been made we may take the point whose 

coordinates are (2), y,) to lie on the regression line, and if further we assume the regression 


ci é ad 2 o 
line to be linear throughout the x, group and to be inclined at an angle of tan-1!7, —’ to the 


Ox 
horizontal we have 
a ey St ee a ign OU. 
Yo= "p+ %p and Yup=%p Tp =, 
a a 
Hence (iv) becomes 
» 20S, > w.\2 
g 1° S {Np (Lp— Xp)*} 
5, oe Ea (v). 
x 


Now S {n, («)—%p)*} is the second moment of the vp») group about its own mean and when the 
distribution is known can often be approximately evaluated. Similarly when the distribution is 
known 7, can be estimated and the correction to 7? calculated group by group. 


But by making certain assumptions we can very much simplify the work, and a practical 
test, in which the assumptions are not justified, will show the sort of errors which are 
introduced. 


The first assumptions are that the regression is linear and the arrays homoscedastic. In 
this case of course 7 is constant and equal to 7 ; we are practically determining a value of 7 by 
the » method. 


The correction then becomes 


Foe SUS tm (%— a9) 


or writing P@=S[S {mp (vp) — Zp)?}] and H? for the raw value of »? after using Pearson’s correction, 
we get from (iii) n?= 7?+7?d? or 
pee 
doa 2) eiolejecelele(ats iota e’are'uYo elafeisleveleis\gie eiee eleieteisisisieie sceaiarsiece (vi). 


To obtain a value for d? we still require to postulate something of the nature of the distri- 
bution and I propose to treat (i) of the case where the unit of grouping is constant and small 
enough for the frequency in each group to be considered to be distributed as a trapezium, 
and (ii) of the case where the frequency distribution is normal. 


(i) First to find the second moment of a trapezium about its mean. 


Let z, and zy be the ordinates forming the ‘walls’ of the trapezium and let the group 
unit be A. 


Then yaa (5%) w is the equation to the ‘roof’ referred to the ‘floor’ and left hand 


‘wall’ as axes. The area is clearly ee a 


The mean is at 


{(“ —2Z,) h3 wt = h Qe +25 


eae |, 978 ce (fore) 
h (Zg+ 24) je ; ~ h (Zg+2y) 3h 2 3° ag Ze ; 
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The second moment coeflicient about the axis of y is 
2 I 2 Cet. FETE PL) Gyn fia 
ee AS ii ge { 25) hi i a Bey + on 
h(zgtZy) Jo * h (z+ 2) 4h 3 6 2,42," 
The second moment coefficient about the mean is 


MW (Qey+2,)? _ fet eeetal i? fi 4 esa) 


(25 + Z,")? 12 EaNZaetew 


Mt Beg hey MO 
6° 242 9° (a +29)? 18 


9 


Clearly when h is reasonably small (2) is a quantity of the second order and in 


F “$s “3! 
this case 


IN [gtr csr eeeetcstcs cae seecrecceen ace (vil), 
so that 
_» kal 
2 1 | Lemos 
a ee ie TG [ tereeeeesteessteseeeseeceaon ens (viii), 
126,” | 7 N 


when the unit of grouping is uniform and small. 


(ii) When the unit of grouping is neither uniform nor small and there is no special know- 
ledge of the nature of the distribution, we must needs fall back on the Gaussian curve to give us 
a first approximation to z, and z, for each group. 


In this case 


and it is necessary to determine it, after fitting the frequency by means of Sheppard’s tables. 
Finally, what correction, if any, is to be made for the grouping of y ? 
This will become more apparent from the alternative formula for y?, namely 


as S(y =, Ys)” 
No, 


ip! 


12 
For the second moment of each array should be corrected by the subtraction of ae where & 


is the unit of grouping of y so that 


coe NE 
S(y- Fr — 75 
a TS 


_SY-IotIe-D?-S- HP 
No, 
LSU = HET = 9.) Gs - NSH MSY Hy! 


No, 


_ Sts Ya= YP} 
No,’ 2 


since S(¥,—y)? when summed for each individual becomes S {n,;(¥;— Y)*} when summed for 
each array, and S (y—¥Y,) (y;—Y) vanishes for each array. 

Hence there is no correction to be made for the y grouping except Sheppard’s correction for 
the Standard Deviation of y. 


* The suggestion of this formula I owe to Professor Pearson. 
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I have tested the results on an instance given in Professor Pearson’s original memoir, namely 
the age and auricular height in Girls, correlation table pp. 34 and 54. 


The means of the arrays in the full table are as follows : 


Sonn | UB eEe 
Grouping Age Mean Auricular| Number of 
ss Height Cases : — 
16) ag aE LN NA a 
{| — Cae, 115°25 | 1 | 
| Melia pa 116°9643 7 | | 
| a 117°4722 18 | 
1 yt AG yy 119-1000 40 | 
=) es 120-3026 76 | 
{ = ao) 1216340 125 » | 
= 9—-10 121-7246 177 (| 
SS 011 122-8160 235 1) 
ee 123-1427 261 —% | 
] } Sees 123-8908 309 aie? 
AI aby) 124°8622 263 == |) S21) 4 
ti} — | 4-15 125°7146 198 -- 
(| — | 15—26 1261565 214 — j 
{ = 162717 126°5340 162 | I 
SNe 218 126-9132 95 
Sh Weis 127-0205 7 | | 
(|) — | 19—20 129°5577 13 | } 
] Meee 27 123-8214 ie Ma 
} == |) 2122 126*5000 Sa er | | | | 
=|, 22. 23 125°25 2 ia 
| 
| | | 


These were grouped in seven ways in three of which the groups were of equal width, and the 
other four give an attempt at equal frequency: the method of grouping is set out by means 


of columns headed in Roman numerals. 


The age distribution differs significantly from the 


normal, the constants being 8; =°0013, 8,=2°7101, but it would perhaps have been better to 
have selected a less normal distribution : still it represents the ordinary ‘cocked hat’ statistics 
that tend to occur. 


The regression is certainly not very linear, the growth apparently ceasing at about 18-19. 


The values of 7? (the raw value), H? (the value after using Professor Pearson’s correction) and 
7? (the value after attempting to use the A? correction) are given in the following table. 


| B he 1—2 from 
Number Number 7 (= 190 ») Normal Curve 
of of n° n HH? | ue 
Grouping Groups ‘ = a 
| n n v” n | 
I 20 | 09183 | -303 | 08414 -98489 | -291 “08494 “291 | 
II 10 ‘08657 | °294 | -08290 “08595 | °293 08510 "292, 
Ill 5 ‘O7701 | *278 | -07535 | 08786 | -296 "08635 | -294 | 
IV 9 08836 | 297 | °08510 = — _ *08953 “299 | 
AW 6 08342 | -289 | :08136 = — | ‘08913 299 
VI 5 ‘08218 | -287 | ‘08053 | — == | “O8885 | *298 | 
VII 9) 06203 | -249 | :06159 — — ‘09739 | -312 | 
| 
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It will be seen that the first three, with even grouping, are very close together though 
the number of groups has been reduced from 20 to 5. Similarly the next three are close 
together, and the last is again by itself. 


An examination of the way in which the groups are taken shows that the more the tail 
is bunched together the higher is the value found, and this is what would be expected in this 
particular case, since there is practically no increase of head height with age at the ‘old’ end of 
the scale, whereas for purposes of calculation we have assumed a constant angle for the regression 
line. But it may be pointed out that 7 varies (to the 2nd place of decimals) only from ‘29 to ‘31 
even if we reduce the twenty groups to two, an extreme proceeding which is never done in 
practice. 

At the same time the ordinary six or eight groups may be expected to give results a little 
too high when, as is usual, the regression line is curved. 


II. On the Hereditary Character of General Health. 
By KARL PEARSON, F.R.S. and ETHEL M. ELDERTON, Galton Laboratory. 


(1) In dealing with the heredity of general health we have to meet at once certain funda- 
mental difficulties. We have first the question of environment and secondly the question of 
variety in health caused by what we may term accident. If we deal with families living in 
widely differentiated environments we shall have, or certainly may have, a spurious correlation 
of health in parents and offspring; the resemblance in health will be emphasised. On the 
other hand, when a single member of a family is exposed to a specially differentiated environ- 
ment, i.e. goes to the West Coast of Africa, or spends his life in India, or catches enteric at a 
particularly unfavourable moment, the correlation of general health may be decidedly weakened 
in the case of parent and offspring. These difficulties of differentiated environment and what 
we may, perhaps, term accident cannot be wholly overcome, but we may endeavour to meet 
or measure them. In the first place we can confine our observations to one social class and 
thus go a long way to get differentiated environment removed. If, as in the present paper, we 
deal essentially with the professional classes, there is great uniformity of general environment. 
The food supply is sufficient, the doctor is always at command, physical exercise is fairly general 
and markedly insanitary houses or occupations are practically avoided. We do not think 
therefore that, for the data of the present paper, differential environment is a marked factor 
in producing correlation. On the other hand we do consider it possible that “accident” will 
weaken the relationships sought. The reduction in health-correlations below the values for 
other physical characters, might indeed be taken as a measure of random action on health, 
comparable with the random action of death itself in reducing the correlation of duration 
of life, which has already been discussed by one of us*. Indeed heredity of general health 
is almost as significant for the problem of natural selection, as heredity of duration of life. 


A more serious difficulty in the health-inheritance problem is this very question of death. 
If parents are delicate and health or delicacy is hereditary, they will have delicate children, 
and we may anticipate that more of these children will die than in the case of the children 
of robust or normally healthy parents. Thus only the healthier children of delicate parents 
will survive for us to record their state of health, and accordingly the offspring of delicate 
parents will appear healthier than they really should be owing to the selective death-rate. In 
our investigations we have dealt only with offspring who lived to be adult, ie. to at least 
21 years of age, so that an appreciation of general health could be formed. There is also a 
further difficulty that very delicate parents are themselves likely to die, and we have again 


* R.S. Proc. Vol. 65, p. 290, and Biometrika, Vol, 1. p. 50. 
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less chance of getting their record. Hence the problem is by no means without difficulty, but it 
is of such great interest that we venture to give here our chief results. 


Our health classes were: Very Robust, Robust, Normally Healthy, Rather Delicate, Delicate 
and Very Delicate. These have been verbally defined, and our present definitions are as 
follows : 

SCALE. 


V.R. Very robust.—He has never had to see a doctor, nor been off work through illness. 

R. Robust.—He has only seen a doctor about minor ailments, and has only been off 
work for colds, etc. 

N.H. Normally healthy.—He has not had more than one serious illness, involving, say, 
a fortnight’s absence from work during the last ten years. 

R.D. Rather delicate.—He has had more than one serious illness, but not more than one 
involving more than four weeks’ absence from work during the last ten years. 


D. Delicate.—He is off work through illness at least four weeks in all every year. 
V.D. Very delicate.—He is in a chronic state of ill-health. 


But these definitions were not used throughout the whole of the records included in the present 
investigation, and we found considerable reluctance to the use of the “ Very Delicate” category. 
Accordingly in the present reduction all the delicate categories have been clubbed together. 
As in previous investigations a Gaussian frequency scale was used for Health*, the interval 
on the scale covered by ‘Normally Healthy” being taken to represent 100 units of health, 
which may be called sanitaces. These hundred sanitaces were supposed to represent the range 
of normal health of each type of individuals—fathers, mothers, sons and daughters, and the rest 
of the distribution calculated in terms of them. At the mean of the parental category was 
then plotted up the mean number of sanitaces of all the offspring of either sex of parents of 
the given category. Thus the round black dots of our four first diagrams were obtained. It 
was found that the three points marking the mean health of offspring of “Very Robust,” 
“ Robust ” and “ Normally Healthy” parents were in all four cases closely on a line which passed 
through the mean of parental and filial health, but that the mean health of children of delicate 
parents in all four cases gave a point lying markedly above this line. This line in all four cases 
shows a marked slope indicating that as the health of the parents is worse so the health of their 
offspring is worse. 

To illustrate the probable source, or at least part source, of the anomaly in the case of the 
delicate parents, the percentage of sons dying before 21 years of age and so escaping record 
of health was obtained for each health group of the fathers and this percentage was plotted in 
Diagram V. to a Gaussian scale of Father’s health. Again we note a fairly uniform increase of 
the percentage of deaths with decreasing goodness of health of the father, for the three better 
categories, but a very marked excess of deaths—quite off the line—in the category of delicate 
fathers, where some 13 °/, of sons die as against the 7 °/, in the case of “ Very Robust” fathers. 

(2) Still preserving our Gaussian scales, we may consider that it is the sons of delicate 
fathers who have been especially selected. In this case the health of the sons of the three 
left-hand categories of fathers will give very nearly the true regression line for unselected 
material. We have made no attempt to algebraically fit the best lines to these three points, 
but placed them graphically on the points and through the general means t. 


* Cf. Huxley Lecture, Biometrika, Vol. 1. p. 146. 

+ This was done because the three left-hand points always fall nearly on the same line with the 
general means, but of course the means of both parents and offspring would be somewhat lowered had 
we the full complement of delicate sons of delicate parents:—to increase the number of such sons 
would ipso facto increase the number of such parents and thus both means of health in offspring 
and parents would be lowered in the same direction probably approximately that of the true regression 
line. 
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But there is another disturbing factor, which the diagrams make at once obvious, the mean 
health of parents is in very considerable degree better than that of the offspring. Measuring 
our mean health of any population in sanitaces from the division between “Robust” and 
“Normally Healthy,” negative towards normally healthy, we have the following results : 


TABLE I. 
Mean Healths of Various Populations. 
A 

Fathers of Sons : + 2°72 sa. 4°81 
Fathers of Daughters: — 2°09 sa. 
Mothers of Sons: — 20°22 me _ 949 
Mothers of Daughters: —17°73 sa. if 
Sons of Fathers: —17:11 sa. _0:29 
Sons of Mothers : — 16°82 sa. é 


Daughters of Fathers: — 35°53 al _0-89 
Daughters of Mothers: — 34°64 sa. 


These results show at once that the health of the parents is far better than that of their 
offspring. This does not imply that the younger generation has degenerated but only that 
there is a selection of the more healthy for parentage ; the more robust men and women are, 
the more likely they are to be parents and repeated parents. The differences between sons of 
fathers and sons of mothers is due to a difference of material and the same applies to daughters 
of fathers and daughters of mothers ; these differences are probably those of random sampling. 
The differences in the cases of parents are more marked and possibly significant. If they be, 
then we have some slight suggestion that a healthy father and a delicate mother would be more 
likely to have sons and a delicate father and a healthy mother daughters. Even if there be 
anything in the suggestion, it would only be shown in large numbers, and is not a universal rule 
for individual pairs. All we can say is that the numbers do not flatly contradict a popular 
impression of the kind. 


(3) Another result brought out by our numbers is that the health of the male is markedly 
better than that of the female in both generations ; this is possibly the eftect of a more stringent 
selection of the male. Such a selection may be of two kinds, first the known heavier death-rate 
of the male, and secondly a greater objection to admission of delicacy or even to giving a record 
at all on the part of the delicate male. The general difference can be seen in the following 
percentages of delicate individuals : 


TABLE II. 
Percentages of Delicate Individuals in Various Populations. 
Fathers of Sons: LES hn 
Fathers of Daughters: 9°68 °/.. 
Sons of Fathers : 13°69 °/,. 
Sons of Mothers : 13°57 °/.. 
Mothers of Sons : 16°65 °/.. 


Mothers of Daughters: 16°56 °/.. 
Daughters of Fathers: 20°44 °/.. 
Daughters of Mothers: 20°82 °/.. 


There is an increase in the delicacy rate as we pass from Fathers of Sons to Fathers of 
Daughters ; a slight fall only in the delicacy rate as we pass from Mothers of Sons to Mothers 
of Daughters. But both are supported by the rates of “ Very Robust” where we find the 
following percentages: Fathers of Sons 18°76; Fathers of Daughters 17-91; Mothers of Sons 
11°28 ; Mothers of Daughters 13-02, 
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Diagram V. 
) 


PERCENTAGE OF SONS DEAD. 
wy 
to} 


15 


20 1 
175 150 125 100 75 50 25 0 -25 -5SO -75 -100 -125 -150 -175 
V.ROBUST. ROBUST NORMALLY HEALTHY - DELICATE. 

(4) In the following table the Standard Deviations of the various populations are given also 

in terms of sanitaces. 
: 
TABLE III. 
Standard Deviations of Health in Terms of Sanitaces of 
Various Populations. 


Fathers of Sons: 72°25 sa. 
Fathers of Daughters: 75°31 sa. 
Mothers of Sons: 82°40 sa. 


Mothers of Daughters: 84°65 sa. 


Sons of Fathers : q 
Sons of Mothers : 75° 
Daughters of Fathers: 7 
Daughters of Mothers: 80:42 sa. 

The definite conclusions we can draw from this table are: first, that women are definitely 
more variable in health than men, and secondly that daughters seem less variable in health 
than their mothers, while sons are possibly but only slightly more variable than their fathers. 
On the whole the nature of the health selection between the older and younger generations 
seems to be of a character which leaves the variability only slightly modified, but shifts very 
considerably the mean health as a whole from parents to children. 
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(5) The results so far reached enable us to test the degree of stability of our scale. If that 
scale is a reasonable one, the range of the “ Robust” category ought to come out with reasonable 
sameness when measured in sanitaces for the different populations. We find : 


TABLE IV. 

Range of Robust for Various Populations. 
Fathers of Sons: 66°80 sa. 
Fathers of Daughters: 67:02 sa. 

Sons of Fathers: 74°80 sa. 
Sons of Mothers : 76°95 sa. 
Mothers of Sons: 79°61 sa. 


Mothers of Daughters: 77°52 sa. 
Daughters of Fathers: 81°54 sa. 
Daughters of Mothers: 82°47 sa. 


It will, we think, be clear from these results that the terms employed in our categories have 
been used in rather different senses when applied to males and females, and when applied to 
the younger and older generation. If we assume “ Normal Health” to be the same for all types, 
then the category ‘ Robust” has been used in a wider sense for women than for men, and in a 
wider sense for the younger than the older generation. Nor does this seem unreasonable when 
we compare ordinary practice, in which undoubtedly a different health scale is applied to men 
and women, and to old and young. As we have seen there is on an average a lower state of 
health in women than in men and we are apt to judge by deviation from the average rather 
than by absolute condition. Again the average health of the older generation is higher than 
that of the younger, and one is rather apt to compare the health of the offspring with that 
of the parent instead of applying an absolute standard to both. Anyhow without laying much 
stress on the reasons for the personal equation, it is probable that judgment does differ in the 
matter of health according as we are dealing with man or woman, and with the old or young 
generation. 


Speaking in quite round numbers we may say that the range of our “ Robust” in women, 
regardless of whether they are parents or not, is about 80 sanitaces and this is almost equal to 
the standard deviation of their health (mean of four populations 81:4 sa.). In the case of men 
of every status, the range (71°39 sa.) of the “ Robust” is slightly less than the standard deviation 
(747 sa.), but for many purposes it may be sufficient to consider both as 75 sanitaces. We 
have no means of ascertaining the absolute health in sanitaces of any individual, but if we were 
to assume 300 sanitaces as the stock of an individual on the border of “ Normal Health” and 
“ Robustness,” the average man would have about 292 and the average woman about 273 units of 
health ; one woman in a thousand would have less than 29 units of health, and one man less 
than 67. The “ Very Robust” man would be a man with more than 371 units and the “ Very 
Robust” woman would be a woman with more than 380 units. Finally one man and one 
woman also in a thousand would have more than 517 units of health. Thus while the most 
robust men and women in the thousand are of the same calibre, the most delicate woman 
has less health than the most delicate man,—a result possibly of the more stringent death- 
rate ; a man needs more health to survive at all. Of course these results are purely suggestive, 
but they flow with some probability from the lower average health of women and their greater 
variability. We should not desire to place any great weight on them. Other data for different 
age groups and social classes will be discussed later, and then it will be more possible to propose 
with greater certitude a definite health scale. 


(6) Looked at from the average environment of the professional classes, there can be little 
doubt that our diagrams indicate that general health is a hereditary character ; but we have of 
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course to regard the difficulty of the record in regard to delicate offspring of delicate parents: 
We want if possible to get some measure of the correlation in health of parent and oftspring 
from our first three points in each diagram. 


Clearly we can find the slopes of the regression line passing through these three points. If 
oo be the standard deviation of offspring and op of parents, the slope 


‘) 
S= —XfYop. 


This of course is independent of normality. Again the weighted mean square deviation of 
arrays has the value og?(1—7op?). Let the 8. D.’s of Very Robust, Robust, and Normally 
Healthy be o4,, oa,, and o,,, then 
Ny Orn +2 ous +73 Or xg 


9 
Ny +N, + Nz 


oo" d = Top’) = 


2 2 2 
l-rop? ™% Og + Not (ota 23 Tas 


Top" (2 +2y+ 23) op? X 8? 


Now o,., o7., Go, Can for each array be expressed in terms of 4 the 100 sanitaces of the 
a9 Fag) Tag 


“Normally Healthy ” range for that array and s can be carefully measured on the diagrams. It 
only remains to consider what value shall be given to op. Undoubtedly some parents are 
omitted because they have died from delicacy, but on the whole we are convinced that there has 
been rather less selection of parents than of offspring. Accordingly we have put op its value 
in terms of the range of ‘“ Normally Healthy.” Thus rgp can be calculated, without regarding 
the final anomalous array. 


But clearly we have to correct the result for our grouping in arrays of parents, but for 
parents only, as the 8. D.’s of the arrays have been found from total frequencies of the groups, 
on the assumption that each array is normal. We must therefore divide each correlation by 
the correlation between class-index and individual character—a point discussed in another 
paper (see pp. 116 and 134 above). 


These corrective factors, 7,@_ of our notation, are : 
x 


rc, for Fathers in Fathers and Sons : "9258. 
% 5 Fathers and Daughters: -9333. 
9 Mothers in Mothers and Sons: 9354. 
55 = Mothers and Daughters: +9384. 


Thus we have 


Correlations of Health, Parent and Offspring. 


Raw * Corrected 
Father and Son : “4456 4813 39 
Father and Daughter: 2852 sabe we 
Mother and Son: “3551 3796 | 37 
Mother and Daughter: 3407 3631/7" 


Mean = "3824. 
We have also considered the correlations from another standpoint. We have for the slope s 
of the regression line for the three first points 
grap, 
op 


* Corrected of course for defect of delicate offspring of delicate parents, i.e. found from formula 
for rop above. 
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Therefore : 


o 
rop=— Xs. 
90 


Now op may be slightly too small because some delicate parents will be omitted and oo will 
probably be too small because, as we have seen, many delicate offspring escape record. We 
shall thus get rather too large values of 7p. There will be no correction to be made this time 
as the values of op and og are based on frequencies occurring between certain limits. The 
following values were obtained : 

Slopes Correlation 


Fathers and Sons: 55 5247 49 
Fathers and Daughters: 33 3184 : 
Mothers and Sons: 345 3758 37 
Mothers and Daughters: 34 3578) ° 


Mean = '3942. 


This confirms the values previously obtained and we think we may adopt them and especially 
the mean* value ‘3824 first reached as at least a fair approximation. Now suppose the 
chance of an individual’s health being due to some other cause than heredity to be p, then in 
a population of WV pairs the chance of both pairs having their health as a natural inheritance 
will be (1—p)? and the ratio of correlated to the total material will be (1—p)? V/J, this 
will measure the reduction in the correlation of health between parent and offspring due to 
accidental and extraneous causes, 


Now the full strength of heredity for physical characters has been shown in tne professional 
classes to be on an average about ‘46: see Biometrika, Vol. i. p. 357. Hence we have : 


(1 —p)?="3824/-46, 
One p= 0743. 


Thus the health of only 7:-4°/, of the population is determined by accidental causes. In 
92°6 °/,, i.e. in the great bulk of cases, heredity is the chief factor in the determination of general 
health. Without laying too much stress on the actual figures, we think it may safely be con- 
cluded that heredity in the professional classes is the chief source of the good or bad health of 
individuals. 


Tables. General Health. 
TABLE I. 


Father's Health. 


| Very Normally ees Ae 
al Rebiret Robust Healthy Delicate | Totals 
= | Very Robust ... 209°5 
a | Robust a 555°5 
a | Normally Healthy 843 
a Delicate 255 
| 
MN 

| Totals 


* There is something anomalous in the high value of the father and son correlation, but we have 
not been able to trace it to any definite origin. 
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TABLE II. 
Mother’s Health. 
Very | | Normally | r : 
4 Robie Robust Healthy Delicate Totals 
$ | Very Robust . 76) 405 42 27 192°5 
an Robust sae 55°5 | 215°75 201°5 72°25 545 
a2 | Normally Healthy 61 189°75 424°5 134°25 809°5 
5 Delicate os i 9°5 66°5 102°5 64°5 243 
NM et 
| Totals 202 «= | «5195 )=— |S 7705 298 1790 
TABLE III. 
Father’s Health. 

a le i al 
“a Very Normally | + ee 
3 Robust Robust Healthy Delicate | Totals 
x Very Robust . 53°5 33° 25°5 
“2 Robust 117°5 207 149 26°5 500 
& | Normally Healthy 110 245 °25 462°25 95 912°5 
oa Delicate 66°5 114 165°75 49°75 396 
A Totals 


TABLE IV. 


Mother’s Health. 


Very 
Robust 


Robust 


Very Robust . 


Robust 76 191°25 
Normally Healthy 85°5 213 
Delicate aa 31 95°75 


Daughter’s Health, 


Totals 


Normally 


Healthy 


30°5 
164°25 
431°75 
157°5 


Delicate 


Totals 


136°5 
489°5 
861 
391 


1878 


Biometrika 1x 


42 
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III. Note on the Honduras Piebald. 
By KARL PEARSON, F.R.S. 


The first report of this piebald with photographs was, I believe, brought back to Europe 
by M. le Comte Maurice de Périgny. Professeur R. Blanchard published an account based 
on the Comte de Périgny’s data in the Bulletin de la Société franeaise a’ Histoire de la Médecine, 
t. IX. p. 213, Paris, 1910. Through the courtesy of the Comte de Périgny I have been provided 
with copies of his photographs, which will appear in the Second Part of the Monograph on 
Albinism soon to be issued by E. Nettleship, C. H. Usher and myself, and are reproduced as 
Plates X. and XI. here. This piebald boy is of much interest because he belongs essentially to 
the “classical type,” illustrated in paintings of the 18th century, and no living piebald of this 
type had so far come to our knowledge, although we had in the First Part of our monograph 
given many illustrations of early cases. 


But there is further scientific interest in this piebald because he supports the point of view 
emphasised by me in the monograph referred to, that when a pure race is crossed with a mixed 
race—a hybrid between races with markedly different degrees of pigmentation—then piebalds 
are likely to appear de novo. On this ground it seems to me idle to speak of piebaldism as 
a Mendelian unit character, and it is idle equally to talk of it as a latent unit character, for there 
is no evidence at all that it ever occurred before in the pure races whose crossing leads to, 
these piebalds. In the present case the mother has Mexican and negro blood, the Mexican 
being already a mixture of Spanish and American Indian. She thus combines black, red and 
white races. The father is a coal-black pure African. But several of our piebald pedigrees show 
that it is sufficient for piebaldism when a pure red or white is crossed with a pure dark race 
and then the hybrid be mated again with either pure race, the produce of this second cross may 
be a piebald. The light race may be merely an albino variety of the dark race, and in several 
of our dark race pedigrees we find such piebalds occurring in stocks wherein albinism has 
occurred also. Indeed given an albino occurring in a dark race it seems possible from the 
hybrid between normal and albino by crossing again with normal or albino to produce almost 
every shade of colour as well as every variety of piebaldism. If such cases occur in man and 
axolotl, it seems unnecessary to seek in the ancestry of the albino for possibilities of either 
piebaldism or colour, as for example has been done in the case of mice. There is no reason 
to suppose brilliant colours latent in the normal axolotl, nor piebaldism latent in the normal 
man. At any rate where there is no evidence of it before the crosses take place, it is more 
reasonable to suppose it a product of the crosses themselves. 


Without entering fully into a matter which will shortly be discussed at length elsewhere, 
I would point out that when a hybrid is formed between a black dog which has bred true for 
many generations and an albino of another race, this hybrid is either black or black with white 
markings on chest, never in our experience so far a true piebald, but when this hybrid is crossed 
again with the albino, we obtain at once not only black dogs, or black dogs with white markings, 
but black and white piebalds, lilac and white piebalds, rusty black dogs, red dogs, and albinos; 
possibly as the work goes on other types will appear also. Much the same changes seem 
to arise from like crosses not only in mice and axolotl but also in man. 


In the case of our present piebald we have the father a pure coal-black negro and his 
brothers and sisters are like him, the mother is a mixture of negro and Spanish-Indian blood ; 
she has a fair skin, black eyes and long black hair, and brothers and sisters are fair skinned like 
her. There are six children; the three eldest are boys aged 11 years, 9 years, and 7} years 
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Lisbey, the Piebald Boy of El Cayo, British Honduras 
K. Pearson, through the courtesy of Robert H. Franklin, Esq., July, 1912. Plates 
VIII and IX compared with X and XI indicate that the frequency and relative 
size of the colour patches appear to have altered. 


Vrom a photograph taken for 
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Lisbey, the Piebald Boy of El Cayo, British Honduras. From a photograph most 
kindly provided by M. le Comte Maurice de Périgny. Lisbey is shown with his 
Father. Front view, 1908. Reproduced from Pearson, Nettleship and Usher's 


Monograph on dAlbinism, Part I. 
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Lisbey, the Piebald Boy of El Cayo, British Honduras. From a photograph most 
kindly provided by M. le Comte Maurice de Périgny. Lisbey is shown with his 
Father. Back view, 1908. Reproduced from Pearson, Nettleship and Usher’s 
Monograph on Albinism, Part IL. 
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respectively, the last being the piebald ; the three youngest are girls aged 4 years, 2 years, and 
9 months respectively. These last three children together with the piebald are given in Plate 
VIII. The grandparents were all “normal,” i.e. not piebald. Plate VIII. suffices to show 
not only the difference in colour between father and mother, but the range of colour in the three 
daughters. 


As soon as I had seen M. Blanchard’s account of this piebald boy I wrote to the District 
Commissioner, at El Cayo, Mr R. H. Franklin, and he most kindly sent me the fuller 
particulars here given, as well as arranged to have the family photographed for me. He tells 
me that the youngster is the pet of the place as well as its “curiosity” and that he is quite 
an intelligent boy. He was born on December Ist, 1904, at Peten in Guatemala and the reason 
his parents give for his piebaldism is “that owing to an eclipse of the moon on the night of his 
birth, he caught it in the head and it scattered over his body.” There was no eclipse of the 
moon on that date nor any near it. 


One point further may be emphasised with regard to this boy. In our Monograph on 
Albinism, p. 248, Plate I (23)—(26), we deal with the case of a piebald boy from Papua and 
we give photographs taken at nine years interval. The dark patches have grown larger, but 
they have not increased in number nor in relative size. In the present case a careful com- 
parison of the dark patches on the legs and arms of this Honduras piebald boy in the recent 
photographs (Plates VIII. and IX.) of July 1912 and those of the Comte de Périgny of some four 
years earlier (Plates X. and XI.), shows with almost absolute certainty that the relative area of 
dark pigmentation has considerably increased. This remarkable fact renders the boy of special 
interest, and it is to be hoped that in still later years photographs of him may be obtainable for 
comparison with the present series. 


IV. Selection and Intermediates in Bacillus coli. 
By LEONARD KEENE HIRSHBERG, M.A., M.D. 


In the course of some other work on a strain of Bacillus coli, taken from the rectum of a 
Scotch collie, and planted first in beef tea on October 5th, 1911, transplantations and agar plates 
were made of these organisms with the original purpose of studying what has been hitherto 
called involution forms. The course of this work directed my attention to the possibility of 
these forms being dependent upon the quantity and quality of the pabulum or nutrient material 
furnished to the bacteria, and hence these so-called involution forms being actually types of 
polymorphism. 


Incidentally observations were made, in view of the claims made by the biologists represented 
by Professor Castle of Harvard on the one side, and Professor H. 8. Jennings on the other, along 
the lines of possibly selecting races of long, short, and various intermediate generations of this 
colon bacillus. If it had proved possible to select a type of long, intermediate, or short bacilli, 
that had remained within the limits of the select mode without reverting back or possessing the 
power to generate all the types, it would have strengthened the work of Professor Castle on such 
higher animals as rats, with their manifold and necessarily complexly interrelated factors. As it 
is, however, after making two hundred and twenty transplantations of colonies of this strain of 
organisms, and trying to generate true long, short, thin, narrow, and intermediate types for both 
length and thickness, I find at the conclusion of that part of the work that there is absolutely 
no ground in my experiments supporting selection as an element in generating a particular type 
of these bacteria. 


Placed in suitable nutrient media, at room temperature, these bacilli divide very rapidly by 
simple fission. From twenty minutes to half an hour is the average time of division, yet if we 
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allow an hour, it is seen that in twenty-four hours—the average time of the transplantations on 
tubes or plates of various media for selecting—one bacillus gives generations numbering seven- 
teen millions of separate individuals. 


When the nutrient material is favourable and incubator temperature is used, the types of the 
bacilli tend to be shorter and actually thinner—although relatively thicker in appearance— 
while the rate of fission is much increased. In solid media there is also a greater tendency to 
the small forms, while in peptone and other liquid and at the same time less favourable material 
the long tenuous and slowly dividing types are more common. 


When bizarre types like those formerly called “degenerated bacilli” such as flask-shaped, 
drum-stick, dumb-bell, Indian club, and tenpin-like variations made their appearance, it was 
always possible to transplant these and obtain polymorphous bacilli of all the previously 
known specimens. 


True enough, it was often difficult at first to start these unfavourably shaped types growing. 
They required close attention, frequent transplantations, and the best media such as milk, 
serum bouillon, and sugar agar or sugar gelatin, but they inevitably came around, and again 
generated every type that had been observed during the experiments. 


Hanging drop slides as well as basic aniline dyes were used in the work, and although there 
was no standard speed by which the motility factor could be studied, an incidental attempt was 
made to select slowly moving from rapidly motile organisms. This too was without success, and 
also depended evidently on food supply, temperature, and other environmental changes. The 
succeeding generations always produced both types and none generated true forms of motility. 


Although some attempt was made to call all types under two micromillimetres short, and all 
over six micromillimetres long, the intermediates were allowed a range of three to five micro- 
millimetres. All above five-tenths of a micromillimetre were thick, while thin ones were one or 
two-tenths, 


Summary and conclusion : From these experiments, which it must be emphasized are incidental 
to some other bacteriological studies, it seems that in the case at least of Bacillus coli, a condition 
of polymorphism exists. 


Efforts at selection in two hundred and twenty-five transplantations of thousands of genera- 
tions each, resulted in absolute failure to obtain any true strain of form or motility. The 
organisms, while subject to great variations about the given mode of the variety according to its 
food and environment, always reverted to the previous mean in subsequent generations. 


All types were under the proper conditions possessed of the power of generating all other 
types, hence selection as a method of generating any of these or any new type brought no 
result. 


CORRIGENDA. 


In Vol. vur. pp. 262—6 in the ‘‘Study of Pygmy Crania” by Miss H. Dorothy Smith, a slip occurs 
which is several times repeated. The crania dealt with are described as of the Third Dynasty. They 
belong to the XXVI—XXX Dynasties. As to the cemetery from which they were taken: see Flinders 
Petrie, Gizeh and Rifeh, p. 29. 


Mr J. I. Craig wishes to state that he has discovered the difference between Professor Myers’ 
measurements of head-length and those used in his own paper on the ‘‘Anthropometry of Modern 
Egyptians” (Biometrika, Vol. vitr. p. 78). Professor Myers measured as usual from the glabella; 
Mr Craig was told that the prisoners’ heads were measured in the usual way (see loc. cit. p. 67, § 4), 
and supposed this to be also from the glabella. But he now finds that the Egyptian criminals are 
measured from the nasion. This explains the large differences between the head-lengths of the criminals 
and soldiers commented on in the Editorial footnote on p. 78. 
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ANOMALIES OF PIGMENTATION AMONG NATIVES 
OF NYASALAND. 


A CONTRIBUTION TO THE STUDY OF ALBINISM*. 


By HUGH STANNUS STANNUS, M.D. Lond., etc. 


I. Classification. 


bf ON page 148 of Part 1 of the Monograph on Albinism by Pearson, Nettleship 


and Usher from the Department of Applied Statistics, University College, London 
(Dulau & Co.), 1911, Professor Karl Pearson remarks, after dealing with some 
cases of albinism reported by myself and embodied in that work, “ It is impossible 
to study Dr Stannus’s data without being impressed with the various grades of 
albinism to be found in the dark races.” It is particularly with the wish of drawing 
attention to this point that I now publish notes on cases of albinism which have 
come under my notice since the appearance of the above monograph. Pearson 
notices that there appears to be a greater variation in the albinotic characters 
among dark-skinned peoples, and seems inclined to attribute it to the fact that 
the various lesser grades of albinism are more readily distinguished among them than 
among fair races. With this I agree, and, as my own cases will show, intermediate 
forms between more or less well-marked types are found in numbers. This is 
a matter which I think has largely escaped the observation of those who have 
published records of cases. I am of opinion that for this reason consideration of 
the observations made by a single individual on a number of cases from a single 
country is more likely to give a correct impression of the subject than a mass of 
material drawn from many sources. 


The difficulty in defining albinism lies in the fact that short of a microscopical 
examination of all the tissues of the body it is impossible to say whether pigment 
is present or not. Dealing with the tissues more especially concerned in albinism, 


* This paper formed part of a thesis presented for the degree of Doctor of Medicine at the University 
of London. [The absence of Dr Stannus in Nyasaland has thrown upon the Editor the responsibility 
of editing these pages for press. Names have not always been spelt uniformly in the type-written 
manuscript, and at various other points the Editor has badly needed Dr Stannus’ help. The Editor 
has ventured to add a few footnotes in square brackets on topics, which he would much preferred to 
have previously discussed with the author. Ep.] 
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that is, those tissues which normally contain sufficient pigment to be evident to 
the unaided eye, it has been found that, even when they appear to contain none 
to the unaided eye, on microscopical examination a certain amount of pigment 
may be present ; such has been demonstrated in the case of the eye and hairs. 


Pearson divides albinotic negroes into the following classes : 
(1) Complete albino. 
(II) The spotted (freckled) albino. 


(III) The blue-eyed white-skinned negro with photophobia and nystagmus 
but not red pupils. 
(IV) The yellow-eyed white-skinned negro. 
(V) Xanthous negro. 
(VI) The partial albino or piebald negro. 

Considerable confusion I think has arisen over the matter of the colour of the 
eye, which in my opinion invalidates some descriptions and also probably the 
above classification. “Eyes said to have the pinkish colour common to albinos,” 
“the eyes are pink,” are expressions commonly found in reading descriptions. 
Again, Sir H. Johnston in referring to Nyasaland writes, “cases of Albinism when 
the hair is yellowish white, the iris of the eye pink and the body skin an un- 
wholesome-looking, reddish white are not common.” When I say that I have 
collected a very large number of albinos in that country and that I have never 
seen a case presenting this characteristic of the iris, I think it may be assumed 
that in this case as in some others a lack of precise observation has led Sir H. 
Johnston to make an incorrect statement. 


The same is, I think, true of a number of the loose statements made by other 
persons. They know that in typical European albinos and in, say, white mice 
the eye is “ pink,” and when they see a native of Africa with white skin and hair 
in whom it is difficult to get a proper view of the eye owing to nystagmus and 
photophobia they assume that the “eye is pink.” 


Dr Turner in describing cases of albinism seen in S. Africa, does not specify 
the colour of the irides and pupil in individual cases but refers only in general 
terms to the peculiar pink colour of the pupil, yet from the context it is evident 
he is speaking of the red reflex obtained through the pupils with the use of 
artificial light, though Pearson does not make this point clear. 


The point I wish to emphasize is that in all cases noted by myself, when seen 
in ordinary daylight the pupil appears black and there is no red colour seen 
through the pupils under these conditions as seen in European albinos. I believe 
this to be true of African albinos in general, though I do not altogether doubt 
the occurrence in rare instances of the other phenomenon. The explanation lies, 
I think, in the fact that absolute albinism of the eye in negroes is very rare and 
that the amount of pigmentation present is sufficient to prevent a large light 
entry and reflexion: that is to say, the number of complete negro albinos who 
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will come under Pearson’s Class (I) is very small, and this type is not found in 
Nyasaland at all events. He may on the other hand be referring to the presence 
of a “red reflex” obtained during ophthalmoscopical examination *, 


When examined with artificial light and ophthalmoscopic mirror, all those of 
my cases approaching complete albinism show a “red reflex,” differing from the 
dull red grey in the normal native in being brighter and devoid of the element 
of grey to a variable extent; and in some the brightness being greater than 
in the ordinary European, simulating that in a European albino. 


The reason of the absence of light reflex (red pupil) in daylight is I believe 
due to the impediment to sufficient light entry caused by pigment on the posterior 
surface of the iris as much as to light absorption by pigment in the choroid. 


In all my cases approaching the condition of complete albinism the irides are 
of a liquid steel blue grey colour, in some associated with a peripupillary ring of 
orange colour. 


This I find corresponds to the descriptions given by observers in other parts of 
Africa whose writings bear the impress of being the result of careful observation. 


I have considered these points at some length as I do not feel that the working 
classification given by Pearson clearly defines itself. 


In Class (III) what does “not red pupils” mean? Does it mean the pupil is 
not red in the sense that the white rat’s is, or does the expression refer to cases in 
which no red reflex can be obtained with artificial light ? 


In which class are to be placed cases which though they give this red reflex, 
have blue irides, in (I) or (III) ? 

The brightness of the red reflex varies very considerably, and I shall therefore 
make out one class which includes all those cases with white skin, white and 
yellow white hair, pale blue grey irides which give with artificial illumination a 
red reflex of any degree as opposed to a grey one. I shall also include in the 
same group as a sub-class those albinos with the above characteristics in whom, 
in addition, there have developed spots of pigmentation, so called Spotted or 
Freckled Albinos; I do this because the condition is not uncommonly found in the 
albinos of this country beyond the age of infancy. 


I think it may be of use to make a group under Class (IV), that is, cases with 
yellow eyes but white skin. 


* [I think Dr Stannus is incorrect at this point, the red reflex from negro albino eyes was noted in a 
number of cases even before the invention of the ophthalmoscope and has frequently been noted since 
by trained observers not using artificial light. Our albino dogs have eyes almost identical for amount 
of pigment with those of the albino negro, and in three cases out of four the observer thinks they 
have dark pupils, on the fourth occasion the red reflex is caught because the incidence of ordinary light 
is suitable. If four or five albino dogs are in a room together and they are then excited, any observer 
sees the reflex at all turns. My classification was, of course, a broad one but I think a serviceable one 
as it roughly graded the albino negro into two classes according to the intensity of ocular pigmentation, 
i.e, by manifest or non-manifest red reflex, observable by the traveller without apparatus. Ep.] 
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Under my own Class (IIT) I shall consider cases of Xanthism as I understand 
that condition, Again I would refer to Sir H. Johnston’s misleading remarks, 
“occasionally there are cases of positive Xanthism or a state of colouration similar 
in a much less degree to albinism—namely that wherein the colour of the skin 
and the iris of the eye is quite a light yellowish brown; this type is very much 
admired by the negroes.” It is the last sentence which shows the incorrectness 
of the deduction, though this of course would not be appreciated by anyone who 
does not know the country. 


The light coloured woman admired by natives has a yellow skin, a rather cold 
colour, with irides of a light brown, all the hair, however, is always black as in the 
average native. People of this colour are more common among certain tribes— 
the Achikunda, those of Senna on the Zambesi river, among the Yaos and the 
ruling families of Momberas Angoni—and I look upon this skin colour as a racial 
characteristic and not as a feature of Xanthism. Whether or not albinism is 
more commonly associated with such peoples I am unable to say, though Pearson 
thinks there is some reason to believe that blondism does exist side by side with 
albinism, occurring in the same districts and, it has been asserted, in the same 
stocks. This was first pointed out by Pritchard and has since been supported by 
Pearson, who quotes cases of men from Nyasaland observed by Turner in South 
Africa; the latter however remarks that these cases had “ pubic hair brown,” “in 
other cases the skin was a light rich red colour, wool on scalp a light yellow, the 
eyebrows and lashes reddish brown”; these cases I should immediately admit as 
cases of Xanthism but not those referred to by Sir H. Johnston, who does not 
mention hair colour and the skin colour of whose cases is yellow rather than reddish 
brown. 


By Xanthism, then, I mean a condition characterised by a skin colour of 
reddish brown, red, or warm brown colour, scalp hair a golden or reddish brown, 
the same being true of the face and body hair, the irides being a light brown or 
hazel *. 


Just as cases of imperfect albinism or partial albinism have to be recognised, 
so there are degrees in Xanthism, as in one of Turner’s cases, but I look upon 
the distinctive points about the affection as the peculiar reddish “warm” colour 
of the skin, generally if not always associated with a similar colour of all or some 
of the hair; the yellow-skinned natives referred to above, though lighter in colour 
than many cases of true Xanthism, very seldom freckle as Xanthous natives do. 


Dr Turner has also observed a number of natives with light hair “or down” 


on the temples, a fact which Pearson seems to think should be taken as significant 
as one of the first signs of a lighter pigmentation. With those downy hairs I am 
perfectly conversant but believe them to be merely an overgrowth of lanugo hairs. 
These fine downy hairs moreover are found in many natives elsewhere on the 

* [The difference between Dr Stannus and myself appears to be that while I speak of a red or 


yellowish red skin, red, brown or even yellowish brown hair and brown to yellow irides as cases of 
Xanthism, he excludes anything beyond golden brown in the hair, or beyond hazel in the irides. Ep.] 
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body; they are always of the lanugo type and generally so small as to pass un- 
noticed but they will be seen to be of a lighter colour, golden to brown, and are 
quite distinct from the curled “woolly” hairs of the body skin. 


There is another type of skin pigmentation for which a class has to be made. 
I refer to cases which with a light brown skin colour, not red and not yellow, and 
light hazel irides, have hair of a dirty yellow colour but with no red tinge in it. 
Some of these cases have photophobia and nystagmus pointing to deficient 
pigment in the eye. Typical cases form a well-marked group though I shall 
later show there are no absolute distinctions to be made between any of the 
groups or classes and that intermediate cases are met with all along the line. 
This last class is characterised by the fact that at birth the skin and hair are 
practically white but that slow darkening has occurred. The same may have 
been true of the irides but native observation upon which I have had to depend 
mostly in these cases has not been sufficiently good to remark this point. 


These cases resemble the remarkable case of Dr Ascherson’s referred to by 
Graves and quoted by Pearson, and are, I think, akin to those described by the 
same author as occurring in Papuans and elsewhere, in which the skin is darker 
than in the first form (ordinary typical albino) and in which nystagmus may or 
may not be present, where freckling is absent or at any rate not conspicuous, 
while the hair is tow coloured. Similar cases have been reported from Malay, 


I have considered above albinism of various degrees but always affecting the 
whole of the skin, hair and eyes, that is to say cases of complete and incomplete 
albinism. It now remains to deal with the condition in which the affection is 
limited in extent, what is termed Partial Albinism. According to the extent of 
the albinotic characters this class is sub-divided by Pearson into Piebalds and 
Spotlings ; the leucosis may be complete or incomplete. 


It is therefore a difficult matter when considering cases to be sure under 
which category any particular one should be placed. Some doubt was entertained 
as to the existence of Piebalds until quite recently, when Seligmann’s Papuan case, 
Gilbert Smith’s English case and a family I have described from Nyasaland 
established the undoubted existence of Piebaldism. Persons showing a much less 
extensive area affected, often a single patch, have been conveniently called 
Spotlings and such are not uncommon in Nyasaland. It is to be noted that the 
area or areas affected tend to be the same in different cases. In’ Piebalds the 
belly and chest are involved and often associated with a flare on the scalp and 
“garter” or “stocking” markings. In Spotlings this arrangement and extent may 
be wanting. The various members of the Nyasaland family will be seen to 
present intermediate forms. This distinction into two groups is warranted by 
the fact that in a number of cases where the affection is inherited the type is 
preserved, 


The resemblance of the Nyasaland Piebald family to Bishop Harman’s family 
is worthy of note, and it has been suggested that such cases of human piebaldism 
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“are intensified representations of a light-bellied quadruped, that we have to deal 
with a case of palaeogenic inheritance,” a subject associated with the name of 
Sir J. Hutchinson. 


It will be found that I have set apart the Spotlings in which the penis only 
is affected as they form a special group owing to their great definition and 
owing to the little attention that has been paid to them. 


I have pointed out, as will be seen later, that the pathognomic feature is the 
very light “pink” colour of the affected areas and their sharp margins. 


My own working classification, therefore, is : 


Complete (1) White skin, straw coloured hair, liquid blue iris with red reflex 
aaa through pupil when examined with artificial light. 

eueties (la) Cases as above showing spotty pigmentation of the skin. 
are ({I) As above but the iris light yellow or hazel, photophobia and 
Albinism. nystagmus not always present. 


Xanthism. (IIT) Reddish brown hair and skin, light brown irides. 
Incomplete (IV) Skin light brown, not red; hair dirty light yellow, not brown ; 
Albinism. ae 
irides hazel. 
(V)  Piebalds. 
Partial oo ee 
(VI) Spotlings. 


Albinism. 
(VII) Spotling of penis. 


II. Lllustrations of Anomalies of Pigmentation. 


I shall now give notes of cases and discuss the group to which each belongs. 
Some cases of leucoderma are then described after which the relation of leuco- 
derma with albinism is discussed and the theories as to pathogeny of the con- 
ditions brought forward. 


The cases of albinism so far recorded from Nyasaland are as follows: 
A. 1. From Kayuni’s, N. Nyasa, by Dr Davey (p. 145 Monograph). 


2. Charlie® of Chikulamayembe, N. Nyasa, by Dr Davey (p. 145 
Monograph). 


3, 4, and 5. Cases mentioned by Mrs Young of Karonga, N. Nyasa 
(p. 145 Monograph). 


* T have seen this case and would add to the description already given that the scalp hair of a light 
straw colour grows down over the forehead to the eyebrows. The whole of the back and limbs are 
covered with semicurved very light straw-coloured hairs. Ivides light grey blue with a light orange. 
coloured pattern. Pupil black in daylight. Teeth irregular, ill-formed, and carious. The pedigree is 
given as Fig. 7 of Piate XXI, II 2 and II 4 were brothers, there are no other known albinotic members 
of the family. 
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B. Cases seen and reported by myself and included in the Monograph on 
Albinism (to which pages and Figures in Pedigree Appendix refer) : 
1. Girl aged 44 years, daughter of Dingo, Chiromo; Fig. 426. 
2. Boy aged 24 years at Zomba; p. 148 ftn. 
3. Chesiwandiri of Maumi, S. Nyasa; Fig. 430, Plate Y (77). 
4. Ng’ombe, 8. Nyasa; p. 147, Fig. 427, Plate Y (80). 
5. Child of Mtila, S. Nyasa; Fig. 431, Plate Y (79). 
mee of Mvera, C. Angoniland; Fig. 428, Plate Z (84). 
7. Kantaye 
8. Old man of Nkoma, C. Angoniland | __ . 
9. Old woman 7 + ii Pees 
10. Young woman of Nkoma, C. Angoniland; Plate Y (78). 
11. Soto of Angoniland ; Fig. 433. 
12. Sikumbiri of Blantyre; Fig. 434. 
In the families of these cases 6 other cases occurred making a total of 18. 
13. The Piebald Family of Florence Bay; p. 250, Plates H (21), 
RR (147)—(150). 


C. The following cases are now reported for the first time: 


1. Alice. 8. Chikoya. 

2. Saiti. 9. Nderemani. 

3. John. 10. Mateuta. 

4. Pingo. 11. Child of Annie. 
5. Chibwana. 12. Moyichandi. 

6. Tom. 13. Sawali. 

7. Ali. 


Case O1. Alice of Chikanji’s Village, Dwangwa, Awemba Tribe. Pedigree, 
Plate XXI, Fig. 11. 


The father and brother, a boy aged 6 years, are of average colour, the mother 
is rather light-skinned, café-au-lait. No relatives are known to be albinotic. 


The albino (Plate XVIII (25)) is a child aged 8 months, she has a pinkish 
tinge to a very white skin, no vestige of pigment visible. Hair as nearly white as 
possible, not woolly on head. Inrides a slaty blue with orange pigment network 
round the pupils. Ophthalmoscopic examination impossible. Red reflex not 
present in ordinary daylight. Lateral nystagmus and photophobia marked. 
There are some blisters and sores on neck and ankles, the result of sunburn; on 
body and limbs fine white downy hairs present; a small umbilical hernia present. 
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This child, who presents complete albinism of the skin and hair, shows the 
peripupillary orange-coloured network, but no trace of brown pigment in the iris. 
At the present time, in the absence of any opportunity for making a microscopical 
examination of irides, showing this condition, I am a little doubtful how such 
cases should be classed. I think it is quite possible that this colour network is 
not due to granular pigment at all, but is caused by diffuse pigment such as is 
found in many specimens of red and golden hair, as will be mentioned later. This 
child is therefore provisionally placed in Class I. 


Case C2. Saiti of Zomba. March, 1910. See Pedigree, Plate XXI, Fig. 1. 
The maternal grandmother is said to have been a light-skinned native (I 2). 


The mother’s skin colour is ight brown; Bwanali (III 6) and William (III 5) 
are dark brown; the first four children are dead and other members of the family 
were said to be of average colour, a dark brown. Saiti (III 7 on Plate XXI), 
aged seven, is practically the same colour as the back of the hand (not sunburnt) of 
a fair European, there are some blisters on the shoulders and arms but no freckles ; 
skin thickened in places. The body and limbs are covered with fine whitish 
yellow hairs. Hair of head a dirty straw colour; eyebrows and lashes thick and 
much curled. Eyes; lateral nystagmus present, day-blindness marked, irides 
bluish-grey, with a faint yellow pigment pattern; pupils black. With ophthal- 
moscopic mirror a red reflex shows through the pupil, but not through the iris. 
Fundus a light pink colour, possibly a little lighter than in the average European. 
Teeth very bad. Abdomen prominent, small umbilical hernia present. 


This boy is a very characteristic case of Class I, but shows the orange 
colouration in iris. 

The frequency with which albinos have body hair and also bad teeth will be 
mentioned later. The red reflex was here obtained and the fundus, though found 
to be even paler than in an average European, still contained enough pigment to 
prevent the choroidal vessels being seen. 


Case C3. John of Mwandanas near Piupiu Hill, Zomba. January 6th, 1911. 
Mother and Father Anguru tribe. Pedigree, Plate XXI, Fig. 5. 


With the exception of the first child, which is said to have been white, all the 
other members of the family are known to have been averagely dark-skinned 
natives. An elder brother (III 3) was seen and found of average colour. 


John (Plate XIX (28)), aged 6 years, a complete albino. Skin colour that of 
a fair-skinned European; in places harsh and dry, smooth, with cracking and 
some blisters from sunburn. On the lower part of the abdomen, especially on the 
outer surfaces in front and on adjacent surfaces of thighs, numerous warts are 
present, mostly small, only slightly raised, and rather flat-topped. Others are 
larger, as is one situated over each clavicle, there are also a few which are filiform 
in character. They appear brownish in colour. The body and limbs are covered 
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with light flaxen hairs semicurved, which, on microscopical examination, were 
found to contain no pigment granules. The hair on the head is very light flaxen 
colour, whisker areas covered with curved hairs. Eyelashes a golden brown colour, 
microscopically, no granular pigment in hairs*. Eyes: vision good in moderate 
light and pupils black, nystagmus present, red reflex present with artificial 
illumination. Irides, peripherally slaty-blue, round the pupil a light brown. 
The tongue has a bifid extremity. Teeth: two upper front incisors large, with 
serrated edges; all the front teeth above are spaced. 


The child presents a fairly typical appearance of Class I save for the partial 
pigmentation of the irides, and on this account it must be considered as an inter- 
mediate between I and II. Iam not sure whether the presence of the warts is of 
any significance; as far as I know, they have not been previously reported as 
specially associated with albinism. Note character of tongue and teeth, 


Case C4. Pingo of Chitalu, Liwonde, seen Zomba 18th August, 1910. Pedigree, 
Plate XXI, Fig. 2. 


The parents, aunts and uncles of this case (II 3) and remoter members of the 
family are said to have been ordinarily dark-skinned natives. A cousin, a child of 
a maternal aunt, now dead, is stated to have been an albino. The eldest (II 1) 
of the family, of which Pingo is the second born, was albino, but is now dead ; one 
other brother and three sisters normally black. Pingo is married to a normally 
dark native woman, who has had three children by him, all living and all dark. 


Pingo (see Plate XIX (26) and (27)) is a man of 25 years of age. The skin 
of that part of the body covered by the loin-cloth is still very white, resembling 
the white skin of a European, and is soft and unaltered. The rest of the body 
shows changes in the skin of varying degree, the skin of legs being less affected 
than elsewhere ; it is rough, harsh, scaly, cracked and creased with sunburns and 
superficial ulcerations; the skin is badly kept and dirty. There is hair all over 
the body and limbs, of a light straw colour; eyelashes, thick but ill-formed, are of 
a rather dark yellow colour. 


Scattered over chest, abdomen, back, arms and neck, and also on the face 
are a number of areas from one third to one and a quarter inches in diameter, 
exhibiting a series of changes, giving the appearance of being of the nature of a 
pigmented atrophy; part passw with the colouration, which commences as a light 
yellow colour, there is apparently an atrophy of skin substance, the final result 
being a black depressed spot, irregular in outline and with a brownish tinge at the 
edges, thrown into relief by the pinkish dirty white colour of the rest of the skin. 
Hair, a dirty straw colour, thick. Irides, a light opaque brown; lateral nystagmus 
present, but little photophobia. Pupils in daylight black. With artificial light, 
a red reflex is obtained, the fundus appearing a lightish red. 


* The microscopical preparations of hairs in all cases referred to were made by Miss E. Y. Thomson, 
of the Biometric Laboratory, with the kind permission of Professor Karl Pearson. 
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When he smiles the subject has a retracted everted upper lip with fold of 
mucous membrane showing beneath it; the upper lip has a central linear 
depression instead of being pointed. 


In reporting the case of Ng’ombe (B 4) it was pointed out that pigment spots 
hardly occurred apart from pitting and that the pitting might be the result of 
ulceration with scarring. In the present case, however, no history of such loss of 
substance could be elicited, yet a similar condition obtained. Both these cases 
are adults, whereas my other cases of spotted albinism are children in whom the 
pigmented areas are not depressed. Will these children develop depressed areas ? 
I think they very possibly will. I believe that localised changes in the skin occur 
and cause the depressed areas, the pigmentation occurring before or at the same 
time, but whereas the pigmentation is visible in childhood the early results of 
depression are not seen. The irides of this man are a light brown, that is to say, 
more pigmented than in Ng’ombe, who only had an orange peri-pupillary ring. 
This case is included in Class II, though the iris colour makes it an intermediate 
type. A portion of skin including in its centre one of the pigmented spots was 
excised from the arm of this case and preserved*. Vertical sections were cut and 
examined microscopically stained and unstained and except for the complete 
absence of pigment granules the Jeucotic skin as a whole appeared in no way to 
depart from the normal. The several layers of the epidermis were not very thick, 
the dermal papillae and processes of epidermis between them being little developed 
but regular. 


The structure of the skin at the site of the pigmented spot (freckle ?) differed, 
however, very considerably. Here the dermal papillae are irregularly prolonged 
towards the surface so that the epidermal processes appear much longer but the 
rest of the epidermis thinner. The condition might be considered to resemble a 
wart which was “in” the skin and not “raised” above the surface. 


Pigment granules were seen to occur in all the layers of the epidermis, most 
numerous in the cells of the downward processes of the epidermis, especially in the 
basal layer of these cells, which envelop the dermal papillae. They were seen to 
be more abundant in the peripheral parts of the cells and most densely packed at 
the deeper poles of the cells. 


The granules were of a brown colour and where massed, appear as a homo- 
geneous black mass. 


In the subjacent dermis some cells were seen to contain pigment granules, 
they were few in number and the number of contained granules was small. 
Passing from the middle of the pigment patch where the aggregation of granules 
was greatest, they diminish in number as the periphery is reached. The dimi- 
nution in the amount of pigment was seen to be gradual to one side of the 
section, the gradation taking place over several millimetres with some isolated 


* See Plate XVII (21) and (22) for microphotographs of albinotic skin and ‘‘ freckle.” 
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masses of cells containing granules beyond. To the other side of the section, the 
transition is accomplished in 0°2 mm. This particular pigmented spot of skin 
was but very slightly depressed below the general skin surface, and though it is 
but a single observation I think the changes described would probably be found in 
all examples of the condition. 


This is I believe the first time a “freckle” from a negro albino has been 
submitted to microscopical examination and it would be unwise to draw conclusions 
from this single observation. 


Case C5. Chibwana of Chikala. July, 1910. Pedigree, Plate XXI, Fig. 4. 


Mother and father and two brothers and a sister all normally dark-skinned. 
The maternal great-grandmother had said there was an albino in the family 
ancestry. 


Chibwana (see Plate X VIII (24)), a boy aged 44 years (1910), has a dry cracked 
skin the same colour as a fair Huropean; pigmented patches are gradually appearing 
on the face, also but less marked on chest and back and less still on the limbs. 
These patches are irregular in outline, black or dark brown in the centre, and a 
lighter brown at the periphery, but they are not depressed. Hair on head reaches 
to eyebrows, there is hair also on the cheeks, back, arms and legs, with little on 
chest, semicurved and white; on microscopical examination, it is found to be 
devoid of granular pigment. Eyelashes long and curled, of slightly yellower tint 
than hair on scalp. Irides, olive green with buff yellow pigment ring round 
periphery ; pupil, not quite circular in outline, appears black in daylight. Lateral 
nystagmus present. On ophthalmoscopic examination a red reflex is obtained 
through the pupil, none through iris; fundus pink as in European. 


I think in this case we see an early stage of the condition found in Pingo, 
which may progress until a condition similar to Ng’ombe is reached. 


Case C6. Tom of Chifira’s Village, Bandawe, W. Nyasa. Pedigree, Plate 
XXI, Fig. 8. 


The mother and father, former dead, were both of average colour. No history 
of albinism in the more remote ancestry. The first three children were albinos 
but were killed at birth according to custom among the Atonga. The fourth 
child was allowed to live owing to more civilised ideas of the chief, but died at the 
age of 15 years. 


Tom, the last born, a boy aged 14 years (1909) is mentioned in the Monograph 
and a photograph by Dr Robertson there reproduced (Plate PP (142)), but the 
history of the case is not given*. The skin of non-exposed parts is a pinkish 
white colour, elsewhere it shows sun-tanning and also a considerable degree of 
spotty pigmentation, the colour being not a brown to black, as seen in other cases, 
but a golden yellow like ordinary freckling in a European. Inrides a light hazel 

* [Given as Fig. 648 in Part IV of the Monograph. Dr Robertson states that there were two 


normal siblings and does not mention the dead albinos. Ep.] 
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brown. Photophobia and lateral nystagmus present. Dull red reflex present with 
artificial light. Vision very fair. No hair on body. Hair very white as opposed 
tothe usual light straw colour; microscopical examination showed it to be without 
pigment. 


The lad, though one of the lightest haired cases I have seen, had hazel irides, 
though, be it noted, as nystagmus and photophobia were marked, he would be 
placed in Class II, but owing to his spots, he is considered in Class Ia. The 
spots of pigmentation present, however, do not approach in colour those on the 
younger boy Chibwana. Tom lived near a mission station, and I think may have 
worn more coverings than the other cases, who wore only a loin-cloth. This 
series of five albinotic siblings, all the children born, is a remarkable example of 
albinism in a family. 


Case C7. Ali of Mwaimba, near Mvera, 8S. Nyasa; Yao; male, aged 16 years. 


Three elder brothers, one younger, and one younger sister, mother and father 
all very dark. No more remote members of the family known to have been 
albinotic. The skin colour of the subject is a light “warm” café-au-lait. He is 
covered with fine golden hairs all over the body; hair of scalp, face, axillae and 
pubes a dirty straw colour. Irides a light reddish brown. Nystagmus present. 
Pupils black. Using an ophthalmoscope in a dark chamber, a red glow through 
pupil is obtained, but none through the iris. The optic papilla appears normal. 
The fundus reflects a red glow, a little lighter than that of a European ; choroidal 
pigment is almost completely absent, allowing the choroidal vascular network to 
be completely visible, the small amount of pigment being collected into small 
deposits lying between the vessels. 


The salient features of this case are a skin and eye colour which would cause 
me to consider it one of Xanthism, the hair also is rather dark for Classes I or II, 
but, on the other hand, the fundus is found to have less pigment in it than most 
other cases. He must therefore be considered to be an intermediate form pre- 
senting affinities with Class II and Class III. He has hairs all over the body, 
which I look upon as a condition very commonly associated with albinism. 


This case I consider an important one in linking up the several types. 


Case C8. Chikoya of Mulowi, W. Nyasa. November, 1909. Pedigree, Plate 
XXI, Fig. 6. 


Chikoya, a man of 25 years of age, is stated to have been, as a child, quite 
white, but has gradually darkened to present colour, of dark sunburn in a 
European ; skin fine, not thickened nor cracked. Fine golden hairs are present on 
the arms, legs and shoulders; there is a light yellow golden moustache and beard ; 
the hair of the scalp is a dark brown colour, as is also the pubic hair, but on 
microscopical examination it is found to contain only diffuse pigment and no 
granules. Hair said not to have darkened with age. Irides a light hazel; no 
photophobia; no nystagmus, vision good, 
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In this man we see a case to be placed in Class IV, a group characterised by 
light brown skin colour (not red), in adult life, with nearly always a history of 
darkening with age; the irides are hazel, the hair, though it is a dark brown 
colour, contains no pigment granules. 


Case C9. Nderemani of Namalwere Village, Fort Maguire. 


This man, aged 25 years, was seen in Zomba in 1911 when returning from 
work on the South African mines. 


Skin a rich “warm” brown colour; hair a rich red brown hue; axillary hair a 
dark brown; there are a few golden coloured hairs on the body and limbs. Ivides, 
dark hazel; no nystagmus; vision good. There is a small umbilical hernia 
present. He says he has darkened with age and more rapidly since going to the 
mines than before. : 


This is a typical case of Xanthism, Class III. The hair, on microscopical 
examination, was found to contain few pigment granules, but a considerable 
amount of diffased pigment. 


Case C10. Mateuta of Chiri, W. Nyasa. 


Father of average colour. Mother states that she used to be “ white” when a 
child; at the present time her skin tint is a light brown; irides hazel; hair black. 


Mateuta, a child aged 2 years; skin colour a very light brown, face of lighter 
tint than body; irides greenish hazel, no nystagmus; hair light brown yellow 
with red glint in it; eyelashes and brows of a lighter shade; no hair on body. 
As an infant, is said to have been nearly “white.” Vision apparently good. 
Mother and child constitute, I think, a short series, showing a progressive pig- 
mentation, of value to the general subject. 


Their histories show them to have possessed at birth a white, or almost white 
skin, which has become pigmented with age; the child’s skin has not yet reached 
the degree of pigmentation of the mother’s. The child’s hair is a light brownish 
yellow, which is becoming darker; whether it will actually become black remains 
to be seen, but I expect it will. The same is true of the irides. The mother’s 
are hazel, the child’s are greenish hazel, strongly suggesting that increase of 
pigmentation is in process of taking place. 


Case Cll. The child of Annie and Annausu of Nkumbira, Nkata Bay. 
Pedigree, Plate XXI, Fig. 12. 


All relatives dark; no other cases of albinism in family. The child (II 12) aged 
8 years, is stated when born to have been almost white and to have darkened 
since; the same is said to be true of the hair. The face is a very light sallow 
earthy colour with dark freckles; skin of body a warmer tint of brown. Hair of 
scalp dark brown. A few fine downy hairs are seen on the body and limbs. Irides 
hazel; no nystagmus ; vision good. 
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This child, aged 8 years, can, I think, be taken with Mateuta and his mother 
and might be placed in the series as intermediate between the mother and her 
child. Together they seem to illustrate simply a delay in normal pigmentation, 
a phenomenon to which I do not think attention has been drawn, but one which 
must be taken into consideration when any attempt is made to theorise on the 
cause of albinism. 


Case C12. Moyichandi of Sangano, Blantyre, Yao. Pedigree, Plate XXI, 
Fig. 3. 

Moyichandi, aged 18 years (II 5), is the brother of a case (B 12) Sikumbiri 
(II 6) previously reported in the Monograph, Fig. 434. 

The skin is a light but “warm” reddish café-au-lait colour distinct from any 
normal native skin colour; irides hazel; no nystagmus. Hair light brown in 
colour, which on microscopical examination was found to contain fairly abundant 
granular pigment and a considerable amount of red brown diffuse pigment. 


Sikumbiri, a year younger than his brother, has a lighter tint of skin, not 
so red; the irides are light hazel and nystagmus is present; hair dirty light 
straw colour. Both have hair on body and limbs of a yellow colour. In these 
two brothers we see an excellent example of the connection between two grades 
of deficient pigmentation; the elder I should class immediately as a case of 
Xanthism, whereas his younger brother must be placed as an intermediate 


between Classes II and IV. See Plate XVIII, Fig. 23. 


Case C13. Sawali of Ntiya Village, Mlumbi, Zomba ; Yao, 1911. Pedigree, 
Plate XXI, Fig. 9. 


Sawali, II 1 (for photograph see Plate XII, 1 and 2) and his younger brother, 
II 2, aged 13 and 5 years respectively, of whom the elder one only was seen, are 
said to resemble one another. Sawali seen in January 1911; hair of scalp from 
light yellow to brown, fairer on neck and at the edges of the hairy scalp; eyebrows 
and lashes black, no hair on body, pubic hair black; no albinotic characters in the 
skin, Le. the skin is of a slightly lighter tint than the average native, but well within 
the limits of normal. Irides medium brown; no nystagmus; vision good. Teeth, 


all the incisors are serrated thus: Wee ) ; they have the appearance of 
having ill-developed enamel. 


Seen again two months later. He had in the meantime cut his hair very 
short, almost shaved, and allowed it then to grow again; the soft less-curled hair 
on the temple is a mouse-brown colour, the newly grown, tightly curled wool of 
the scalp is of a yellow colour, becoming brown near the roots; scalp colour is 
the same as the skin. He states that he was born with a white skin which has 
darkened since. 


Individual hairs were found to vary in colour; some very light golden hairs 
were found microscopically to contain a yellow diffused pigment with very fine 
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granules in fairly large numbers; brown hairs contained brown diffused pigment 
and abundant granules; some light hairs contained comparatively few granules, 
fine and widely scattered. There are some spots of pigment-like freckles of dead 
black colour, a few on neck, chest and back. On ophthalmoscopical examination 
a slight dark grey red reflex is obtained through the pupil; the fundus is a dull 
red, rather lighter than in average native. 


Cases of partial albinism in which the hair only is involved but completely 
must be of rare occurrence, and I know of no case among African natives; such 
a striking figure, if seen, would surely have been recorded. 


Note should be taken of the character of the teeth and the presence of 
body hair. This case should be considered together with the last cases above 
mentioned. 


Ill. Piebalds and Spotlings. 


With the exception of the Florence Bay family referred to earlier in this 
paper no other cases of Piebaldism have been seen. 


The following cases are examples of spotlings that I have come across in 
Nyasaland. In the first group mention is made of white tufts of hair: in the 
second, spotlings of the skin are described. 


(A) Several cases of natives presenting congenital white tufts of hair have 
been seen; they resemble each other almost exactly in that the canities occurred 
in the form of a median-placed flare on the anterior half of the scalp more or less 
triangular in arrangement, base in front. There were generally also a few white 
hairs scattered over the rest of the topmost portion of the scalp. 


One such case also presented right and left helical fistulae, a congenital 
anomaly of no significance in this country, as it occurs in about 47/ of the 
population. 


Salule, of Chitanji Village, Chiradzula, showed two small tufts of white hairs, 
one placed in the median line, the other a little to the right and behind it, at 
the summit of the scalp; the scalp beneath them was a little lighter in colour 
than the surrounding. This man states that he was born like this and that one 
of his brothers, but no other member of the family, is similarly affected. The 
white hairs submitted to microscopical examination were found to be devoid 
of pigment. 


G., the wife of a private in the King’s African Rifles, aged 25, was seen to 
have white hairs scattered over the scalp which were said to have been present 
since birth; she presented no other albinotic characters. 


Chitesima was another example of this condition. Microscopical examination 
showed the white hairs to be devoid of pigment, granular or otherwise. 


H., a girl, aged 6 years (1909), from a village on the Lake shore in West 
Nyasa, presented a curious and interesting condition exemplifying partially 
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deficient pigmentation. The affected areas were the right side of the upper 
part of the face, two small patches on the same side of the neck and a patchy 
condition of the fingers. (See Plate XIII, Fig. 5.) 


The area on the face was sharply limited along the mid-line of the nose and 
forehead. The two patches on the neck were at the anterior margin of the 
trapezius muscle over the mid-point of the clavicle. The mottled condition of 
the fingers rather resembled that seen in Ziemann’s leucoderma. 


The general skin colour was that of cake chocolate. The affected areas 
were the colour of bruised chocolate. In addition to the affection of the skin, 
a segment of the iris about one-sixth of the total iris in extent, with the base 
directed towards the right side, was of a light hazel colour, the remaining five- 
sixths being a dark brown, the same as the iris of the other side. The hairs of the 
scalp, eyebrows and eyelashes were normally black. The condition was noticed 
at birth, according to the mother’s statement, and had not altered. The mother, 
father and brother of the subject were all normal, and no members of the family 
were known to have presented any pigmentary anomaly. 


I, female, aged 5 (1909). Seen at Kondowe; she presented congenital 
anomalies of pigmentation in that there was absence of full colouration of the 
skin; the back of the right hand, with the inner three fingers, the palm, and 
the back of the right arm, the abdomen, groin, thigh and leg of the same side 
were the seat of patchy deficiencies of colour, with spots quite white or pinkish 
white to various shades of brown, resembling somewhat the condition seen in 
leprosy, or the non-leprous leucodermic affection of the skin of the hands 
described by Ziemann. The condition was stated to be congenital and not to 
have altered. No other members of the family were affected. 


Daus, of Chimwai village, Dwangwa River, male, aged 11 years (1909). An 
area of skin on the cheek, as indicated in Fig. 6, Plate XIII, is of pinkish white 
colour resembling that of an albino; the area is smooth and on the same plane 
as the surrounding dark skin. The demarcation is abrupt, with no increase in 
pigmentation of the skin in the neighbourhoud. Over the albinotic patch and 
also more abundantly on the surrounding skin are distributed fine white short 
curved hairs, no other hair being present on the cheeks. A few white hairs are 
seen on the same side of the upper lip, and the middle portion of the eyelashes 
on the upper lid of the same side are white. 


There is an area of skin just in front of the tragus of a lighter hue than the 
rest; of the skin of the face, which is a rather dark brown black colour. Irides 
dark brown. History states that these anomalies of pigmentation were present 
at birth. They have enlarged with growth, but there has been no relative increase 
in size. No other members of the family known to be albinotic. 


The girl “H.” presents a condition of extraordinary interest. In the first 
place, I can put on record a case of congenital incomplete spotling albinism, that 
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is to say, a case with small areas of skin, congenitally deficient in pigmentation, 
but not completely albinotic. Secondly, the case demonstrates, I believe, a pre- 
viously unrecorded condition, a partial and incomplete albinism of the iris; it is 
noteworthy also that the condition only affects one side, in fact the area on the 
face is strictly limited to the one side by the mid-line of the nose. This case 
will be mentioned again later. 


In speaking of the other two cases I am not quite so sure of my ground. 


The girl “I.” presents no definite pink area of skin, but a condition resembling 
very much that described as Ziemann’s leucoderma. It is said to have been 
present at birth, and the similar condition of fingers in the girl “H.” supports 
the probability of this statement being true. The history that the condition is 
not changing is against leucoderma and therefore, on the whole, I am inclined 
to include it as a case of spotting. 


(B) The special group of partial albinos characterised by an affection of the 
glans penis must now be considered. 


My attention having been drawn to the point, the condition was then sought 
for among a number of adult men aged from 20—50, and including circumcised 
and uncircumcised, those with a partly retracted prepuce and those with a long 
covering. More or less complete circumcision is practised among the Mohammedan 
Yaos, but not among other tribes. 


At the outset one must recognise that, just as in Europeans so in natives 
of Africa, there is very considerable variation in colour of the glans penis and 
inner surface of the prepuce, colour conditions which are fairly comparable to 
those of the mucous membranes, Just as in Europeans, variations from a 
delicate pink to the colour of uncooked lean bacon, from almost white to 
a purple red, occur; a similar variability is seen in uncircumcised natives, but 
always in the darker hues, that is, the skin in this situation always contains 
some pigment. 


Dr Stachan, quoted by Pearson, states that the glans penis beneath a 
phimosed prepuce is always red; I am not quite sure whether he wishes to 
infer that the skin is devoid of pigment or not, but certainly the latter condition 
is the truth. 


In the circumcised, the glaus is practically always of the same hue as the 
rest of the penis and scrotum, Le. a darker tint than the general colouring of 


the body. 


In the uncircumcised, as a rule, according to the extent of the natural 
uncovering of the glans so is the degree of pigmentation. 


The portion of glans normally uncovered by a naturally partially retracted 
prepuce is often darkly pigmented, shading off to red as the corona is reached ; 
sometimes the general colour is a dark lean bacon-red, it may be with a purplish 
hue; in others, the general colour of the glans is an opaque leaden white. In 


Biometrika 1x 45 


350 Anomalies of Pigmentation among Natives of Nyasaland 


all these cases, however, all variations in colouration shade into one another, and 
it is evident pigment is present. 


Those cases which I have considered as spotlings show areas of various extent, 
always of a “pink” colour, with sharply defined margins, this being the test of 
a departure from the normal, which I consider includes the variations in colour 
above mentioned. 


The following is an analysis of nearly 400 adult males, between the ages of 
20—50 years, being prisoners and soldiers, and therefore they can be considered 
as taken at random from various tribes. 


379 Adult males examined with reference to colour of glans penis and prepuce. 


(A) 122 Circumcised: All have completely black glans penis except two 
(a and b). 


(a) Some spotty pinkish colour present at the seat of the operation 
cicatrix. (N.B. the circumcision was performed when subject was adult.) 


(b) Chambveka, the general colour of the glans is a dark brown, but there 
is a light brown spot on the dorsum near the meatus. 


(B) 259 Uncircumcised : 


(1) Inner surface of prepuce and the whole of the glans penis dark brown 
or black : : , : : ; : : : : . AD, 


(2) There is a gradation of colour from the corona to the meatus and 
corresponding part of the inner surface of the prepuce from blue red, leaden or 
yellowish white to dark brown or black ; : ; : : > 7 LOO! 


(8) Glans and inner surface of prepuce of a uniform colour, from a leaden 
white toa bacon-red_. : : : : : A ; : pe (05) 


(4) Glans and inner surface of prepuce of about the same colour as in 
a fair European, but at the free edge of the prepuce shading off into dark 
brown or black of the general skin colour : ; 5 : 2 20: 


(5) As in (4), except that the lips of the meatus were dark brown 3. 


In another case there was some deeper pigmentation about the frenum 
shading off into the surrounding colour; the skin colour was a dirty yellow 
brown, rather like that of a half-caste, with some freckling; hair black; irides 
dark brown . : . : : : 3 5 : ; : A 1 


(6) Spotlings . : : : : : : : ‘ tals 


From the above it will at once be seen that among the circumcised one may 
say no spotlings were found; case (a) had some alteration in the neighbourhood 
of a scar and is of no import; (b) had a condition which may be called leuco- 
derma, or may be a congenital condition, and be called albinotic, but no history 
was forthcoming. 
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Colour variations among the uncircumcised, (1—5) came within the limits 
of normal variation as above described. The single case mentioned in (5) I look 
upon as really one of the intermediate cases towards albinism, a view strongly 
supported by the freckling. 

To deal with cases under (6) in detail : 

(a) Lijuni, aged 30, glans and inner surface of prepuce a very bright pink 
(considerably lighter in colour than in an uncircumcised European), bounded by 
a sharp line of demarcation at the junction of the inner and outer surface of the 
prepuce. There are a few white hairs in the mid-line of the beard and at the 
outer corners of the moustaches, but no other albinotic characters. See Plate XIV, 
Fig. 8. 

(6) Erya, an Angoni, as in (a) but with an area of dark brown, involving 
the lips of the meatus, though covered by prepuce, thus differing from cases 
mentioned above (5). 

(c) Matesi, an Atonga (see Plate XV, Fig. 11). 

(d) Jim, ground colour of preputial-glans-skin of a reddish grey, albinotic 
area shown in Plate XV, Fig. 12. Skin coloura dark chocolate; irides dark brown ; 
on the front of the abdomen there is a spotty condition, light and dark brown ; 
there are a few freckles, one each on the right side of the abdomen, left chest 
and _ back. 

(e) Chizizu, Yao (see Plate XV, Fig. 13), nothing else to remark. 

(f) Lief. Dark brown skin and irides (see Plate XV, Fig. 14). 

(g) Chiwia. Besides the larger patches of albinotic skin on the glans, 
there is a spotty pigmentary change about the frenum (see Plate XVI, Fig. 15). 
The skin and irides were a dark brown colour, and there were some scattered 
grey hairs on the scalp. Man, aged 30 years. 


(hk) Chivia, Angoni, skin and irides dark brown (see Plate XVI, Fig. 16). 

(t) Naamon (see Plate XVI, Fig. 17). 

(j) Yohani. There is a patchy light and dark brown pigmentary change 
in the neighbourhood of the frenum. 

(k) Magombo. Red pink ground colour of glans, etc. with leaden blue 
spot, as shown in Plate XVI, Fig. 18, skin colour a dark chocolate. 


(1) Asani like (£), only spot is brown in colour; scrotum and penis black, 
skin colour light brown with large black freckles, one on each leg, and some on 
body and face. 

(m) Mwenengwi. Tightly phimosed, but some spotty pink areas can be 
seen just on the inner surface of the foreskin. 

A further case has since been seen. Binal, Ist K. A. Rifles, Zomba, 1912 


(see Plate XIV, Fig. 9). 
45—2 
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Lijuni [(a) above] was circumcised by me on 5th April 1911. Fourteen 
months later he was again seen. Examination revealed a circular patch of 
pigmentation dark brown in colour involving the lips of the meatus and a 
small area round them, three-eighths of an inch in circumference ; the remainder 
of the glans was of a light pink colour, totally devoid of pigment as at the time 
of circumcision. The prepuce was carefully preserved and transverse sections 
prepared and mounted unstained and stained with haematoxylin and eosin 
and by Van Gieson’s method for microscopical examination. I may say at once 
that, with the exception of the deficiency of pigment granules, the whole 
structure of the albinotic areas of the skin appears normal. In the normally 
pigmented part of the prepuce the basal cells of the Malpighian layer are the 
most deeply pigmented, some of them being so full of dark brown granules as 
to appear almost black and homogeneous; this is most marked in the cells at 
the end of the deep processes of epithelium dipping down into the dermis and 
also in the cells lining the lower parts of the sides of these processes. Those 
cells less deeply placed are less pigmented, but all the cells, even the most 
superficial ones, of the stratum corneum contain some granules. The granules 
tend to be arranged round the periphery of the cells, especially at the distal 
end of the cells, giving a very characteristic appearance. In the subjacent 
dermis a few cells containing pigment granules are seen, small in number and 
not very heavily pigmented. The transition from the normal pigmentation to 
skin absolutely devoid of pigment is seen to occur in from 1 to 2 millimetres, 
that is to say, it is not absolutely sudden but the gradation is rapid. The 
diminution in the number of granules is practically proportional in all layers 
of the epidermis; in the dermis the loss appears possibly to be absolute before 
the same change is complete in the superjacent epidermis, and the cells lining 
the sides of the deep epidermal processes retain a greater number of granules 
than the cells at the more superficial and deeper parts: see Plate XVII (19) 
and (20). 

Prolonged beyond the last cells containmg granular pigment a band of 


yellowish brown diffuse colour is visible affecting a layer of cells two or three 
cells thick at the junction of the stratum corneum and stratum granulosum. 


IV. Notes on Anomalies of Hair Pigmentation. 


Samples of hair were examined with results as follows: 


Simbeleta and Nederi, a type of rather coarse hair, which is not very curly, 
recognised by the natives as turning white early in life. Microscopically white 
hairs were devoid of all pigment, a few brown hairs showed brown diffuse 
pigment, with densely packed granules, the black hairs being normally black 
and completely packed with black granules. 


Mwana Ngala. Aged 24 years, stated to have had yaws, and in consequence 
of this illness the hair at the sides of the head and temples is brownish in 
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colour; microscopically there was a diffuse brown pigmentation and granular 
pigment was fairly abundant. 


Another child showed the same changes under similar conditions. 


Old female inmate of the Central Prison of an age when grey hairs have 
usually appeared ; she was noticed, however, to have some brownish hairs, and 
others of a bright golden colour; the black and brown hairs were found to be 
tightly packed with black granular pigment—the golden hairs contained light 
yellow ditfused pigment, but no granules. 


Kause. An old female lunatic, presented the same hair characteristics, macro- 
scopic and microscopic. 


Three men were noticed with a certain number of red hairs in the moustache, 
comparable to the hairs in the moustache of a “red haired” Kuropean. 


Yotam. Scalp hair black; skin a lightish brown, is said to have darkened 
with age; irides a dark hazel. Microscopically the red hairs contained a certain 
number of fine pigment granules, but also bright red diffuse pigment, black hairs 
absolute black, with granular pigment; some of the hairs were partially red and 
partially black, each part showing the corresponding condition mentioned above. 


Pamandhe. Brown moustache hairs showed diffuse brown pigment and fairly 
plentiful black granules; black hairs and some particoloured hairs as in case 
above. 


Chamveka. Light brown hairs were devoid of pigment granules, there were 
some particoloured hairs, which, with black hairs, answered to the description 
above. 


Bandsman, 1st K. A. R. Hairs from dermoid cyst of forehead. Hairs spindle- 
shaped, tapering at each end; brown diffuse and granular pigment present, but 
not equal to full pigmentation. 


V. Case of Leucoderma. 


I now describe a case which clinically is one of leucoderma. It is the only 
case showing leucodermic areas of any extent that I have met with, from which 
I think I may infer with reason that leucoderma is rare among these natives 
save in its minor manifestations. 


This case corresponds very well with a text-book description of the disease—a 
symmetrical and progressive condition associated with hyperpigmentation. Points 
of interest are that the patient is sure the hyperpigmentation has occurred on 
sites previously depigmented. The hairs on atfected parts of the skin which are 
naturally hairy are black. Black hairs are also seen to occur on leucotic areas on 
non-hairy portions of the skin. 


Ouda, of Chechiwere’s, male, aet.. 15 years, Zomba, November 14th, 1910. 
Disease started 3 years ago; he was seen at that time by my native Hospital 
Assistant who vouches for this date. He has never had yaws. The disease 
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commenced as a very small patch on the right shoulder which has never spread. 
Now spots are seen on the body, face and limbs (Plate XII, Fig. 3). 


He states that a patch appears very quickly, in a few days reaching its 
maximum size and that the extremities are the last to become involved. At 
the present time the middle two-thirds of the red margin of the lower lip are 
affected, there is a patch on the right side of the filtrum of the upper lip, some 
small areas over the left shoulder, three patches the size of a half-penny over the 
upper part of sternum, a few points over the chest and abdomen in front and on 
back; the arms from the elbows downwards are the seat of large patches; the 
wrists, thumbs, dorsum and palm of hands and fingers also show patchy depig- 
mentation; patches on the left knee, the dorsum of the feet and toes, and over the 
ankles are also present. 


These affected areas are of a pink colour as of a fair European skin; they are 
in some places a little crusted. 


There is a lack of signs of slow depigmentation, that is, areas of all shades of 
dark brown to pink are absent. In association with some pink areas there is a 
surrounding hyperpigmentation ; these the boy states were previously depigmented 
and pink. This phenomenon was well seen on the arms, the leucotic areas pre- 
senting black hairs as on unaffected skin. Affected areas over the manubrium 
sterni, though there is no hair on the surrounding normal skin, present hairs 
which are black. No one else in the family affected. 


Under the name of “ Melung” (Beta), Ziemann has described in the Archiv fiir 
Dermatologie und Syphilis, Bd. uxx1v. 8. 163—170, Wien und Leipzig, 1905, a 
skin disease among negroes on the West Coast of Africa “characterised by 
the appearance of round, oval or irregular shaped patches of a bright yellow 
red colour on the skin of the extensor and flexor surfaces of the hands and 
feet.” Here in Nyasaland I have seen a number of cases which correspond 
more or less with the descriptions given by Ziemann, but there are several 
points of difference, and I am inclined to think that that observer has made 
too hard and fast limits for the type of leucoderma which he has described. 
He says, “The predominant colour of the affected areas is of a whitish tinge and 
the general aspect may be compared to that of extensive superficial burns which 
have healed without the formation of granulation tissue. The disease begins at 
the age of from 10—15 years; its course is symmetrical, either attacking the 
hands first and then the feet or the feet first and the hands afterwards, or both at 
the same time. The progress of the disease is extremely chronic.” He further 
notes that never more than four-fifths of the surface of the hands or feet are affected 
with pigmentatrophy. The flexor surfaces are more extensively involved than the 
extensor, the pigment atrophy is practically confined to the hands and feet, only 
further slight extensions occur after 10—20 years and there is gradual loss of 
pigment in the hairs situated in affected regions. Sensibility to touch, tempera- 
ture, and pain and muscle sense are unimpaired; the sweat glands are normal ; 
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the electrical reactions of the affected limbs are normal and there is no evidence 
of disease of the nervous, blood or excretory systems. 


As far as I am aware no mention has been made of the disease occurring 
elsewhere than among West African negroes, I therefore give some details of a few 
cases met with by myself. 


Chisfufu. Male, aged about 36. The headman of a village near Zomba. 
Subject states that the disease commenced on the left side of the chest when he 
was 6 years of age, then both hands and arms were affected followed immediately 
by the legs, and it has been slowly progressive till the present time. When seen 
there was a spotty hyperpigmentation on the front of the chest (see Plate XX, 
Fig. 29) only two areas being actually of a pink hue, the same condition is seen 
on the back; the changes on the hands and feet are typical. The depigmen- 
tation seems to have occurred in circular areas so that on fusion the margin 
assumes a scalloped form. 


X. Female, aged 50. States that her affection only commenced 2 years ago. 
On examination the left hand and wrist are found to present the typical features 
of the disease, and though of only 2 years’ duration it is as marked as in cases of 
20 years’ standing. On the palm of the right hand there are some spots 2” in 
diameter of slight hyperpigmentation, signs of the commencement of the disease 
in this extremity. 


Binti Masua. Female, aged 30, of Mponda, wife of Mwini Dowa, private 
Ist King’s African Rifles. . 


She states that she had yaws when aged 8 and that the leucoderma 
appeared at that time, and has remained of about the same extent since childbood. 
On examination there are seen some black areas on the arms, apparently the 
hyperpigmented scars of old yaws. She has a Dupuytren’s contracture affecting 
both little fingers, the first interphalangeal joints of both hands are thickened, and 
the fingers show a slight degree of ulnar deflection. The affection is less advanced 
in the left hand than the right. In the former the depigmentation only affects 
the palm, while on the right side, the dorsum and fingers are also involved. 
Hyperpigmentation is present as a band round the wrist, along either border of 
each hand across the bases of the phalanges and on the back of the first fingers 
and sides of the middle fingers. 


Incheta. Male, aged 50, inmate of the Central Prison, Zomba, an Angoni of 
Dedza. 


The disease is said to have commenced at the age of 17 on the palmar surfaces 
of the wrists, since when it has very slowly spread till the present time. 


The affected areas include both surfaces of the hands and wrists, and extends 
two inches on to the forearm; the depigmentation is patchy in character; where 
the process has been most intense the skin is of a yellowish pink colour. Circular 
areas are seen showing all grades of depigmentation as evidenced by the different 
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shades of brown present. Round the areas in which the change has taken place 
there is hyperpigmentation more or less marked, in some areas the skin being 
absolutely black. There is a worm-eaten appearance on the palms of the hands 
which the man states is the result of yaws in boyhood. There is also compound 
ganglion affecting both wrists. The photograph (Plate XIV, Fig. 10) illustrates 
rather poorly these features. 


Mundana. Male, aged 40, of Chikumbo, Mlanje, Makua. 


Patient states that the affection of his hands and feet was present at birth, and 
has not altered since. This I think is very doubtful and probably one may infer 
that the disease began in very early childhood, and has not progressed of late 
years. He says he has not suffered from yaws. Both hands and feet are affected, 
the former to a small degree and the feet still less, so that it is hardly noticeable 
in the latter. 


There are only two or three small areas actually pink, for the most part the 
colouration is patchy and of various shades of brown to orange. The process has 
involved the hands and fingers, the wristband area and the dorsum of the foot and 
toes on both sides, 


There is a hyperkeratosis of the palms of the hands and of the skin of the 
extensor surfaces of the interphalangeal joints. The palms of the hands also 
present a worm-eaten appearance with the formation of little pits. There is 
likewise heaping-up of epidermis with cracking at the flexures on the palms. The 
hairs on the extensor surfaces of the fingers have fallen out, leaving little pits. 


Y. Male, aged 40. A Swahili from Zanzibar, Range Capitao, Zomba, The 
hands only are involved, but in a perfectly typical way. The disease is stated to 
have commenced when the subject was a small boy. 


Other cases have been seen but of them no notes have been kept. In discussing 
the relation of these cases to ordinary leucoderma it must be remembered that 
natives are seen with a similar patchy speckled depigmentation process affecting 
the skin of other parts than the hands and feet. Thus I have seen a condition 
along the middle line of the back or in the middle of the chest on several occasions. 
In the case of a girl L, aged 10 years, mentioned above, it was associated with 
similar speckled condition of the fingers, and was said to have been present at 
birth. Chisfufu, as will be seen by a glance at the sketch on Plate XX, not only 
had a typical affection of the hands and feet but the process also involved the 
trunk, all four limbs and the lips. 


While it is true that there is a large number of cases of an “imperfect” 
leucoderma corresponding more or less to the description given by Ziemann, 
I find that there are many others in which the process is wider in its distribution 
and which are therefore connecting links between the type described by Ziemann 
and ordinary leucoderma. Even in such cases of long standing the areas involved 
are of small extent, and it is but seldom that one would see a portion of skin of 
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any size reduced to the pink colour such as is seen in ordinary cases of leucoderma 
in natives. 


It is however difficult to see where to draw limits around any particular type, 
though I must agree with Ziemann in setting apart the hand and foot type as a 
fairly definite variety. I agree with him in thinking that the disease does often 
start during youth, but I am not convinced that it always does so. Again it is 
nearly always symmetrical after it has been existent for some time, but not always 
so at the beginning. I believe that in most cases the hands are affected before 
the feet, and though in the majority of cases the progress is very slow, yet in some 
it may run a more rapid course for a time, and then become stationary. 


I have certainly never seen more than four-fifths of the hands or feet affected. 
The flexor surfaces are more extensively involved than the extensor, but the 
change is by no means so commonly limited to the hands and feet as Ziemann 
makes out. I would say that it is more usual for some two or three inches of the 
fore-arm to be involved as well. I should be inclined to lay more stress on the 
character of the colour changes. Practically in all my cases there was hyper- 
pigmentation to be seen in areas of skin adjoining those in which loss of colour 
had taken place. This was generally most marked as a black band around the 
wrist, but was seen elsewhere. 


The loss of pigment as above mentioned would appear to take place at a 
number of points more or less at the same or at different times, so that round 
areas are produced showing varying degrees of loss of colour which may coalesce 
with the formation of larger areas with scalloped margins. Often in the middle 
of such a depigmented area, one that may have been reduced to whitish-pink 
or yellowish orange colour, small spots of more deeply pigmented skin may be 
seen to remain. 


Ziemann has compared the appearance to the result of extensive superficial 
burns which have healed without the formation of granulation tissue. Such a 
description I do not consider good, as under those circumstances the skin is 
essentially pink and never of the orange yellow or yellow white colour character- 
istic of the affection under consideration, Again, the laying down of pigment 
after burns occurs at points partly separated from one another, which appear very 
dark indeed, and do not resemble the darkly pigmented patches seen in this 
disease. 


Attention might again be drawn to the fact that the history of yaws was 
obtained in several cases and a few show evidence of lesions almost certainly the 
result of framboesia, I refer to the pitted appearance of the hand and hyper- 
keratosis. The loss of the hair from the digits in one case should also be noted as 
possible evidence of a trophic element in the etiology of the affection. 


Below are short notes of cases of melanodermia of congenital origin as far as 
their histories could be elicited. 


Biometrika 1x 46 
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Female, aged 17, Zomba, 1910. An area of absolute black skin is seen on the 
right side of the neck extending over the clavicle. No other affection of pig- 
mentation noted. 


Bimba, male, adult, Zomba, 1910. Patch of black skin on the dorsum of the 
left foot, over the heads of the first and second metatarsal bones. 


Talo, adult, male, Ist K. A. Rifles, Zomba, 1911. A black patch of skin under 
the right nipple. 


Mainana, adult, male. Below right axilla there is an area of skin 4—5 inches 
square in extent of a dead black colour. 


CONCLUSIONS. 


The cases I have cited go to illustrate the following facts : 


1. Albinism is common among the natives of Nyasaland. 

2. It isa family affection and often directly hereditary. 

3. Varying grades of albinism are met with in the same family. 
4. Almost every degree of albinism is met with. 


5. The degree may vary with age. 


6. A secondary laying down of pigment in the albinotic skin in spots is a not 
uncommon feature. 


7. Albinism is very generally associated with an unusual development of the 
lanugo hairs and less commonly with mal-development of the teeth. 


8. Piebalds are comparatively rare. 
9. Spotlings are less rare. 


10. There is a very definite class in which the spotling condition affects the 
penis only. 


11. Widely distributed leucoderma is uncommon. 


12. A localised form resembling that described by Ziemann is common, but 
intermediate cases between this form and a more generalised form are seen. 


13. Some congenital cases of spotlings resemble exactly some cases of 
leucoderma. 


The grades of albinism in their relation to one another may be diagram- 
matically represented ; this arrangement has however no pretence to mathematical 
precision. 


In the same way the relations of the various conditions characterised by partial 
albinotic characters may be roughly shown. 
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Coming to a more general discussion of the subject, among facts which have 
to be taken into consideration the following mentioned by Pearson are of great 
importance. Leucoderma may occur very early in life and become stationary. 
Pearson says “the possibility in Leucoderma of very early development followed 
by a stationary condition, and the fact that babies only gradually develop skin 
pigment so that congenital unpigmented areas may not be at once noticed, render 
it not so easy to distinguish partial albinism from stationary leucoderma in a 
European.” Concentration of pigment in the neighbourhood of leucotic patches 
has been laid stress on as a feature of leucoderma, but it is not invariable. 
Symmetry and progressiveness and the combination of excess and deficiency are 
held by Crocker to be the characteristics of leucoderma, but as Pearson says 
“there can be little doubt that...leucoderma ceases to be progressive in some 
cases and that in such cases...the increased pigmentation at the borders is 
wanting.” Further the symmetry “may be by no means marked” and the 
distribution of patches in undoubtedly congenital cases of leucosis may show a 
considerable degree of symmetry. Evidence of recovery of pigmentation in leuco- 
dermic patches is not wanting. 


Leucoderma has further been shown to be inherited, and a case due to 
Dr Joseph Jones has been published in which inherited leucoderma seems “ to 
have culminated in albinism.” There thus appears to be no criterion by which 
any case can be judged to be one of leucoderma or of partial albinism, and the 
history may yield the only facts of value. For these reasons, any hypothesis to 
explain albinism must explain at the same time the facts about leucoderma as 
they are known. 

46—2 
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In dealing with the pathogeny of the condition I follow largely Pearson in his 
presentation of various theories. The normal formation of skin-pigment is a 
question around which much conflicting opinion has grown. The site of the 
formation of the pigment has been variously supposed to be in cells belonging to 
any or all of the primary layers, endo-, meso- and ecto-derm. 


Special cells have been described as acting as carriers of pigment from one 
layer to another and called in consequence chromatophores (Kélliker, Simon, 
Kerbert, Ehrmann); Ehrmann later considered these cells to be actually the 
producers of the pigment, and calls them chromatoblasts or melanoblasts. 
Meirowski’s summing up of Ehrmann’s views quoted by Pearson is as follows :— 


(1) The production of pigment occurs in special cells, the melanoblasts, which 
are not identical with mesodermal or epidermal cells. 


(2) The melanoblasts are products of the middle germ layer, which in part 
further develop there, grow into the epidermis, and there have an independent 
cell existence. 


(3) The material which is converted into melanotic pigment comes from the 
blood and is haemoglobin. Haemoglobin is converted into melanotic pigment by 
the vital processes of the melanoblasts. 


(4) Extra-cellular creation of melanotic pigment is not yet demonstrated. 
True melanotic pigment only occurs in an extra-cellular form by the break-up 
of pigmented cells. 


(5) The transfer of pigment occurs by aid of protoplasmic flow along proto- 
plasmic threads, which connect the melanoblasts with the epithelial cells. For 
this reason this theory is not an “ Kinschleppungstheorie” but an “ Einstrémungs- 
theorie.” 


(6) Pigment is, at least shortly after its production, a body dissolved in a 
highly fluid colourless substance. 


Unna believes that the pigment granules formed according to Ehrmann’s 
hypothesis pass by way of the lymph stream to the epithelium. 


Cohn holds that free pigment certainly exists outside the cells. 


Schwalbe believes that epidermal and dermal cells can each produce granular 
pigment as a result of their individual activity, drawing upon the same source, 
though independently; he believes that: (i) a pigmentation of the cutis can 
occur with completely unpigmented epidermis and hair, and (ii) a pure epithelial 
pigmentation can occur without trace of pigment in the underlying mesoderm. 
He rejects the chromatophore theory and considers these cells to be transformed 
epithelial cells. 


Kaposi also criticises Ehrmann’s views to show that a haemoglobin origin for 
melanin is untenable. More recently other observers (Meirowsky, etc.) have come 
to the conclusion that epidermis and cutis pigment can arise independently, and 
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the melanoblasts of one and the other are independent structures. The nature of 
melanin has been investigated by a few observers. Hopkins and Cole have isolated 
a substance as a product of tryptic digestion of proteid which may be the mother 
substance of the proteinochromes skatol, pyridine and tyrosine. Tyrosine by the 
action of a ferment can be changed into a substance resembling melanin. 


This ferment tyrosinase has lately been found in a variety of species, plants 
and animals, including newborn rabbits, rats and guinea-pigs. 


The above notes on melanin are gleaned from Pearson’s observations on 
Von Furth’s summary. 


Dyson has recently shown, by means of the bichromate-haematoxylin method 
of Lorrain Smith and Mair, that pigment is always formed as part of a complex 
granule which contains lipoid substance. These granules are present constantly 
in the basal epithelial cells, and are formed there under the influence of the 
nucleus as a part of the normal metabolism of the cell. The pigment is in origin 
a lipochrome, the melanin being the chromatic proteid portion after separation 
from the complex lipoid granule. He found no evidence of pigment formation in 
the cutis, but showed sections which illustrated the passage of lipoid substances 
from the epidermis into the cutis along lymphatic channels. 


It appears therefore that our knowledge on melanin and its production so far 
points to the following conclusions : 


(1) Melanin is not a direct (and probably not an indirect) product of 
haemoglobin. 


(2) It is the product of splitting of a complex lipoid substance possibly as 
the result of a ferment action, akin to the formation of pigment from tyrosin by 
tyrosinase. 


(3) The complex granules containing the lipoid substance are produced as 
the result of the metabolic activity of the individual cells of the epiderm. 


(4) The presence of melanin is responsible for skin pigmentation. 
(5) Melanin in the cutis is probably derived from the epidermis. 


This is as far as knowledge goes at present with regard to melanin. How can 
we explain its more or less complete absence in albinism, etc.? Pearson sums up 
‘when we consider the relative rareness of complete albinism, of the spotted or 
splashed condition and of Xanthism, their relatively frequent coincidence in the 
same stock suggests that these abnormal pigment conditions are not wholly 
independent, and that as a working hypothesis it is reasonable to suppose that 
complete albinism, partial albinism, incomplete albinism and Xanthism, all static 
forms of leucosis, are phases of the same process and are probably linked with 
leucoderma and possibly other forms of dynamic leucosis. By ‘linked’ we suggest 
that they mark the complete, incomplete, local or progressive failure of the same 
metabolic process, which may never start at all, never start in certain areas, or be 
imperfectly started, and again being started may fail to maintain itself; further 
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that every variety of this failure may individually or collectively be associated 
with certain stocks, which may either show hereditary failure of one phase, of 
several, or exceptionally of all phases of pigment metabolism.” 

This hypothesis gives a masterly summing up of the position, warranted, I think, 
if only in the light of my own series of cases. 

If we seek the cause in some inherited defect we might suppose that some 
pigment-controlling determinant by its absence causes failure of natural pigmenta- 
tion. Its perfect or imperfect impotence may be a racial feature of certain forms 
of parental albinism. This impotence may be a racial feature of certain parental 
stocks or produced by some degenerating influence of the somatic on the germ 
cells of the individual parent such as must occur in the case of an inherited 
character arising in the parent as a “sport.” 

This absence of certain determinants does not carry us very far, for the question 
suggests itself: how do such determinants act on the normal body metabolism ? 
It has been suggested that some ferment necessary for pigment formation such as 
tyrosinase is absent. 

As Pearson says, “it must however be remembered, (i) that the absence of the 
ferment in the albino must correspond to some abnormality in the zygote and 
ultimately to some defect in one or both gametes. This defect can hardly be an 
absence of ferment. (ii) The materials for development are supplied in utero by 
the mother, who may be (a) herself pigmented, and (b) produce at the same time 
pigmented and unpigmented twins. It would appear that she is thus able to 
provide the ferment for one and not for the other offspring.” 

One might postulate ferment determinants. Pearson suggests “that the 
ultimate difference between the normally pigmented individual and the albino, 
will be found after all to be one of structure; the failure of the normal metabolic 
process is due to differential structure in the albino. If there be local absence of 
ferment it may possibly be that the structure does not permit of its reaching its 
destination,” and mentions in support of this view that albinos may have a 
considerable amount of diffused pigment in their hair. 

He also says that it would be difficult otherwise to understand how in cases of 
partial albinism the ferment should be confined to some portions of the skin and 
be wanting in others. Facts gleaned from my own observations which seem to 
have a direct bearing on some of these points I may here refer to. I should like 
first to mention some observations on the natural rate of pigmentation in African 
natives. 

Pearson states that a black baby at birth almost resembles a white one in 
colour, and quotes Dr Strachan*. With all deference to Dr Strachan, I most 

* [This does not seem adequately to represent the views of the authors of the Monograph, or of 
Dr Strachan. The reader is referred to pp. 120—124 of that work, where much of the literature on the 
subject is cited. The point involved is not the impossibility of distinguishing between a just born white 


and just born black baby, but to the fact that the black pigmentation except on the genitals and some 
other parts is relatively slightly developed at birth. Ep.] 
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certainly must differ from him and agree with Dr Pooley, also quoted, when he 
says that “in newborn babies (negroes) the skin being thinner, the pink of the 
blood vessels is much more clearly seen, but the skin has an ashy grey colour all 
the same, the black is there all the time”—a phenomenon which is clearly seen 
in the still-born and premature births. I do not besitate to say that a portion of 
skin of the new-born native of this country could be immediately distinguished 
from a portion derived from a new-born European child. To give an example, the 
infant of Private Jumbe, Ist K. A. R., seen immediately after birth, had a skin 
colour of a pinkish white with a dark café-au-lait tint about it, not the dead 
white as seen in other young albino children; hair of head rather straight and a 
dark brown in colour, body and limbs covered with very light fine hair of the 
lanugo type; irides a liquid steel blue. Darkening of the skin occurred rapidly ; 
it was not noticeably light after three weeks, and in three months it was an 
ordinary “dark” coloured baby. 


Specimens of hair from the head and body having the colour as above 
mentioned were taken a few days after birth. The hair from the scalp was 
seen to contain bright yellow diffused pigment, and a fair amount of brown 
granular pigment. The body hairs were very fine and undeveloped, and very 
fine granular pigment was present. 


The pigmentation, then, of the skin (irides and hair) of the native of Central 
Africa is a progressive process; normally pigmentation is quite marked at birth 
and thereafter proceeds rapidly, but in a large number does not reach, so to speak, 
completion till the age of possibly 16 years. This I have noticed with three or 
four of my younger servants who during the six years they have been with me 
have darkened very materially. 


This production of pigment as the result of metabolism in certain cells might 
be likened to the production of the salivary ferments by the salivary glands. The 
cells are there in the skin ready for their work in the foetus just as the cells of the 
salivary glands are, but the latter do not begin to produce their ferments till they 
are needed (for the digestion of certain foodstuffs); in the same way the skin of 
the young foetus is unpigmented, and it is only when pigment is about to be 
needed that it is formed. During the few days after birth there is a call for its 
rapid formation and it is produced. 


The stimulus for this rapid formation is probably light rays or ultra-spectrai 
rays acting indirectly by means of a ferment, though, now that pigmentation is a 
racial characteristic, the tendency for the process to start and go on is represented 
in the germ-plasm of the parents. This inherent tendency to produce pigment is 
borne out by the fact that the latter was present in the hairs removed from a 
dermoid cyst (vide supra). 


That depth of pigmentation is conserved as the result of such a physical cause 
as light I have no doubt, as the following observation will serve to show. I held 
a medical inspection of the Ist Battalion of the King’s African Rifles the day after 
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part of the Battalion had returned from Somaliland. I was surprised to find when 
the Battalion as a whole stripped that the men who had been on foreign service 
were many shades darker than those who had remained in the Protectorate of 
Nyasaland. The Somali men were as black as any native I have ever seen, and by 
comparison the Nyasaland men looked like members of a yellow skinned race. 
The reason for this difference was that in Somaliland all fatigues had been done 
bared to the waist, while the men remaining in Nyasaland had not been without 
regimental clothing (khaki tunic or blue jersey) for years. 


I obtained a similar result by keeping several turns of bandage round the chest 
of a native for eight weeks. 


Again, I would refer to my observations on the skin colour of the penis. One 
may say that normally the ensheathed glans penis is not deeply pigmented; the 
pigmentation of the partially covered glans is proportional to the want of cover. 
The glans of a man who has been circumcised is invariably black. Therefore 
though always pigmented to some degree because skin pigmentation is a racial 
characteristic, the process is only completed as the result of exposure in the 
majority of cases. Special notice must be taken of the case of albinism of the 
penis; wherein, after circumcision at the age of 26 years, the skin of the glans, 
which was absolutely albinotic and would doubtless have remained so all the man’s 
life had he not been subjected to operation, developed a spot of pigmentation a 
year later. How could this case be accounted for on the theory of structural 
defect* ? It is a fact worthy of remark that the penis should be the commonest 
site to find a spotling albinism and that it should occur in close on 4°/, of 379 men 
taken at random; excluding the circumcised, it occurred in nearly 6°/,. 


In a large number of albinos we have seen there is a marked ability to form 
pigment (vide spotted albinos, and those of Class IV, who develop a certain amount 
of pigment in the skin but not in the hair and again those in whom pigmentation 
is only delayed, cases C11, C12 Sawali) which does not appear to point to 
defective structure of the skin, the explanation favoured by Pearson, but would 
rather suggest the action of some internal secretion. The hyperpigmentation of 
Addison’s disease is due to the perverted action of the suprarenal glands which are 
commonly the seat of the destructive lesions in that disease, and it seems quite 
possible that normal pigmentation in negroes is due to a somewhat similar cause ; 
light may act not directly on the skin cells but on some tissue or substance which 
determines the formation of pigment in the skin. 


Many of the characteristics of persons living in tropical countries are akin to 
symptoms occurring in suprarenal insufficiency. 

With regard, however, to the factor of abnormality of skin structure brought 
forward by Pearson, I would again refer to the conditions as seen in the sections of 

* [I would venture to suggest that the very facts that the pigment developed in a spot only in this 
case, and that pigment occurs in localised patches only in piebalds, or in freckles and isolated locks of 


hair in otherwise complete albinos do indicate a differential structure; the local absence of a ferment— 
without a cause for its absence—seems to me an inadequate explanation of leucosis. Eb.] 
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Fig. 7. Fig. 


Albinotic penis not referred to in text. Lijuni. Leucosis of glans and inner surface of 
prepuce after circumcision. See p. 351. 
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Fig. 23. Normal Native, Moyichandi and Sikumbiri from left to right. Grades 
of albinism in Nyasaland Natives. See p. 346. 


Fig. 24. Chibwana, an albino with mother and sister, Fig. 25. Alice, an albino baby, and her mother 
from Chikala. See p, 343. from Chikanji’s village. See p. 339. 
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Fig, 26. Head of Pingo, an albino from Chitalu. See p. 341. 
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Fig. 27. Pingo, an albino from Chitalu. See p. 341. Fig. 


8. John, an albino from Mwandanas, See p. 340. 
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a pigmented spot from an albino, and also to the association of an unusual develop- 
ment of lanugo and of defective teeth with albinism. 


The condition of the teeth suggests enamel deficiency, and is one which I did 
not meet with when examining the teeth of 1500 natives of all ages (not albinos) 
for statistical purposes. 


One of the interesting points which has been brought to light by the investi- 
gations of Pearson and others is that in albinotic hair, while granular pigment is 
absent, diffuse red or yellow pigment is present, and the same is true of some 
kinds of red hair among Europeans. The relation of this diffuse pigment to 
granular pigment has not however been I think generally recognised. From the 
microscopical examination of the various hairs above described I think there can 
be little doubt that the granular pigment is formed from the diffuse pigment by 
some process allied to precipitation (compare hair from new-born infant child 
of Private Jumbe, from Mwana Negala, from Pamandhe, also from Sawali and 
Moyichandi) *. 


Loss of colour may be accompanied by a reversed process (compare hair from 
Kause, an old female lunatic). 


A similar process may take place in the skin, as evidenced by the band of 
diffuse colouration beyond the area of granular pigment found in the section 
of prepuce (Lijuni). 


The plan of the localised leucotic patches may be palaeogenetic as suggested 
by piebalds with white belly markings and flare, or there may be a segmental 
distribution (see case of girl H. in whom the area is limited strictly by the 
mid-line). 


One believes therefore that pigmentation is due to an internal secretion, that 
light and other climatic conditions are determining factors, that a ferment may 
play some part, that the pigmentation of the skin is due to melanin produced in 
the cells of the epidermis, and that therefore structure also must be taken into 
account, that in albinism in its various grades and in dynamic forms of leucosis 
one or other or several of these factors have to be taken into account. 


Addendum. One may say that in ordinary natives the colour of the iris is 
always some shade of brown. There is one exception. The iris in a few otherwise 
ordinarily dark natives may be of a greenish colour with no element of brown, 
a very striking feature, and known as “jeri-jeri” by the natives, who say it 
resembles the eye of a cat. 


* TIt seems difficult to establish such a conclusion until very elaborate chemical investigations of 
the two types of pigment, lipochrome and melanin have been made. The microscopic investigation of 
hundreds of samples of hair, albinotic and non-albinotic, are given in Chapter VIII of the Monograph, 
but they tell so far as much in favour of the independence as of the dependence of the two types of 
pigment. Ep.] 
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ON THE EXPECTATION OF LIFE IN ANCIENT ROME, AND 
IN THE PROVINCES OF HISPANIA AND LUSITANIA, 
AND AFRICA. 


By W. R. MACDONELL, Hon. LL.D. Aberdeen. 


THE data on which the following paper is based have been extracted from the 
Corpus Inscriptionum Latinarum of the Berlin Academy. For Rome I have 
examined Vol. vi. parts 1 to 4 (2), for Hispania and Lusitania Vol. m1. and its 
Supplementum, also the Ephemeris Epigraphica vit. and 1x. 1/3, and for Africa 
Vol. vu and its three Supplementa. 


Rome—(a) The Civil Population. 


I have divided the Roman inscriptions into two classes, (a) those relating to 
the civil population, males and females being shown separately, and (b) those 
relating to the soldiers of the garrison. 


Method of stating Ages. In the great majority of adults the age is given 
in years only, but in a considerable number the months and days are added, in 
a few cases even hours, Children’s ages in the majority of cases are given in 
years, months, days and sometimes hours. Also on the soldiers’ tombstones we 
frequently find their ages recorded to days, four of them even to hours, and 
amongst these we find natives of Thrace, Pannonia, etc., which points to accurate 
registers of births having been kept even in distant parts of the Empire. Some 
of the inscriptions record the ages with extreme exactness, eg. No. 2771, 
80 years 8 hours; No. 17677, plus minus 25 years 2 months 24 days; No. 20670, 
86 years 1 hour. 

This accuracy of statement is no doubt largely due to the requirements of 
astrology, which we know was widely practised in ancient Rome*; possibly also 


* No. 27140, the inscription on the tombstone of a boy, who died at the age of 4 years 13 days, 
contains a reference to an astrologer who had evidently forecasted a long life for the child. The 
deceased is supposed to address the passer-by in half a dozen lines, which conclude thus: 

Non igitur, lector, lachrimes: [de]cepit utrosque 
Maxima mendacis fama mathematici. 
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it is in some degree due to the operation of the ordinance of the Emperor Marcus 
Aurelius, who was the first to introduce an official registration of births in Rome 
and the Provinces for the special purpose of establishing the status and age of the 
people when required*, 


But, as I have said, in the great majority of the inscriptions the age is given 
in years only, and in many cases it was evidently not known accurately, probably 
no record of birth having been kept, or the record having been lost or destroyed. 
An examination of Table I will make this clear. It will be seen that after the 
15th year the frequencies are heaped up on ages expressed by multiples of 5; 
e.g. at 25 years we find 129 deaths of males, but at 24 and 26 only 57 and 
34 deaths respectively. Soldiers’ ages are treated in the same way. Perhaps 
the explanation is that in many of these cases the age was not known exactly 
in years, but that the number of lustra of five years which the deceased had lived 
would be known fairly correctly, and would thus furnish the basis on which the 
number of years was calculated. Of course we cannot say how such a method of 
computation, if it existed, would be worked out in practice; if for instance it 
was known that no long period had elapsed since a person had completed his 
6th lustrum, his age might conceivably be recorded as 30, if a considerable period 
had elapsed, as 35. Whatever the real explanation may be, it is evident that 
the practice causes a difficulty which will be obvious from an inspection of the 
diagrams on Figs. 1—3: the polygons showing the expectation of life in the 
ancient world present a jagged appearance in marked contrast to the smooth, 
continuously descending curves which represent the data for modern times. 
But these data are subjected to smoothing before publication. The difficulty 
might have been got over, as Mr Palin Elderton has suggested to me, by adding 
together the deaths recorded at ages 18, 19, 20, 21 and 22, calling these 20 
(and similarly for the other groups) and then redistributing them. The same 
result, however, can be attained fairly accurately and with a saving of labour, 
by supposing the jagged polygons replaced by continuous curves, and such curves 
can be readily placed on the diagrams. 


Date of the Inscriptions. Few of the inscriptions which record age at death 
give us a direct clue to the dates at which they were erected. Some that were 
found in the “Monumenta” of families and households of historical personages 
(Livia, Nero, Drusus, Marcella, etc.) can be assigned to the early Empire, and 
a few, mainly Christian of the 4th to 6th centuries, give the names of the 
contemporary consuls, and can thus be dated exactly. In one case, No. 9919, 
the name of the Emperor, Mauricius (A.D. 582—602), and the year of his reign 
are given. It is possible that scholars can date the inscriptions from the form of 
the letters, the spelling, etc., but as I am not aware of this having been done, 
I must leave the point unsettled and would suggest with some hesitation that 
the great mass of the inscriptions belongs to the first three or four centuries 
of our era. 

* Marquardt, Das Privatleben der Rimer?, p, 86. 
47—2 
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Status of the Deceased. A considerable number of the inscriptions record the 
status of the deceased, and a somewhat rough examination gives the following 
results for males* : 


Servi a ee an ae 121 
Vernae a - ney bes 164 
Alumni a oe ae ey 65 
Liberti oe aes ae sa 390 
Officiales and Artifices... re, 290 
Agitatores and Histriones as 20 
Patroni re sae a ye 647 
Ordinis equestris ... ue oe 19 
Ordinis senator... ea oe 8 
Sacerdotes, haruspices and calatores 17 
. 1158 


The mere fact that a person was considered worthy of a gravestone and 
inscription at all, would seem to indicate that the deceased did not belong at 
all events to the dregs of the people, and I am therefore inclined to believe 
that we are here dealing with a population which in status was above the 
very poorest and consisted largely of the slaves and freedmen, with their wives 
and families, of the well-to-do classes of Rome. 


Expectation of Infe. A reference to Table I will show how this is calculated. 
By way of illustration I will take Roman males. I start with a population of 
4575, which I call 0,, using the actuarial symbol; of these 74 died before 
completing their first year, therefore 4501, or J,, completed their first year; 
of the 4501, there died before completing their second year 197, therefore 4304, 
or J,, completed their second year, and so on. Then the complete expectation of 
life at any year a is defined as en, 

i! v1 
2 ar ae ere : 


For example, at the year 1 it is 


1 4304+ 4086+...4+1 1 95185 


at 4501 DG apo 


The values of the expectation are shown by the curves on Fig. 1, and for 
purposes of comparison I have added the curves showing the complete expectation 
of life in modern England and Wales, based on the mortality of the general 
population, not of healthy lives only, in the years 1891—1900. These data 
are of course smoothed by the Registrar-General’s officers before publication. 


* When the indices to Vol. vr. of the Corpus are published it will no doubt be possible to extend and 
correct these figures. 

+ It is worth noting that these seem to have been a long-lived class, the average age of the 64 being 
62—63 years, 
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From Figure 1 we see that, contrary to modern experience, the curve for 
females always lies below that for males until about the age of 75; after that age 
the two curves practically coincide. For the first 20 years the male curve remains 
fairly level, while the female curve goes steadily down, until the difference between 
the two reaches a maximum about the 20th year; both remain level for the next 
10 years, and then they gradually approach, the female curve rising until about 45 
and thereafter falling. If our data had been ampler we should no doubt have had 
curves of a steady downward slope, as in the case of the African expectation. 


The most striking feature of the diagram is the very low expectation of life 
in ancient Rome, as compared with that of the present day, up to the age of 40, 
and especially for the first 20 years of life. This might be accounted for to some 
extent if we had reason to think that monuments in memory of persons who died 
before reaching 20 were likely to be erected in relatively greater numbers than in 
the case of older people. But I know of no reason to suppose that this was the 
case; indeed the probability seems to be the other way, for we might reasonably 
assume on general grounds that the older a man was the more likely he would be 
to make his mark in the social group to which he belonged, and be considered 
worthy of a tombstone and inscription. In the absence of definite information 
I think we may conclude that there was a relatively heavy mortality at the earlier 
ages, due probably to circumstances peculiar to Rome, for, as we shall see, the 
expectation of life was much higher in the earlier decades of age in the Provinces 
than in the Capital. 


Between 50 and 60 ancient and modern expectations are very much the same, 
and after 60 the ancients had the advantage. The latter is a remarkable fact, 
but it is just what we should expect if the view that there was a heavy mortality 
amongst the young is correct; by severe selection of the young the survivors 


would be of specially strong physique, and would therefore show a high expectation 
of life. 


Married Women. Out of the total of 3490 females I have noted 897 whom the 
inscriptions specially record as having been wives or mothers, and I find that up 
to the 20th year their expectation is greater than that of all women, the difference 
ranging from 4? at the age of 10 down to 1 at the age of 20; but after the 
25th year it is less, the maximum difference being 2°8. 


In connection with this question, it would have been interesting to know at 
what ages the Roman women married, but unfortunately I have found only 
59 inscriptions from which this information can be ascertained. These 59 record 
both age at death and duration of marriage; the facts are shown in Table II. 
The first and last entries look doubtful, but the others seem reasonable, and would 
point to marriages at 10 to 20 having been quite usual. This custom of early 
marriages might explain to some extent the low expectation of life for women 
at that period, but little weight can be placed on this conclusion, as the data are 
so scanty. . 
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Married Men. For the age of men at marriage the data are still scantier ; 
I have found only 29 cases, and in these the ages at marriage run from 15 to 63, 
11 of them lying between 17 and 20. 


Rome—(b) Soldiers in Garrison. 


The data have been collected from inscriptions 2424—3922 and 32668— 
33038. They record the nationality of many of the soldiers, and it appears that 
they came from distant parts of the Empire—Dacia, Thracia, Pannonia, Egypt, 
Libya and so on, as well as from various parts of Italy itself. Indeed, if we may 
judge from these inscriptions, the great bulk of the garrison was recruited mainly 
in the Danubian provinces and the north of Italy. I have therefore dealt with 
the soldiers separately—there are 607 whose ages are recorded—and find that as 
regards expectation of life they compare unfavourably with Roman males, and 
from 25 to 50 even with Roman married women. But no very satisfactory 
comparison is possible owing to the scarcity of available inscriptions. 


The age of soldiers on entering the army can be found, as the inscriptions 
often record both age at death and length of service. Fortunately a considerable 
number of such inscriptions have come down to us, viz, 400 in Rome, and 
particulars will be found in Table III. From these figures it seems probable 
that the majority of the garrison soldiers joined at ages from 16 to 23, the 
greatest number centring round 19; but remembering the absence of precision 
in stating ages we can only look at the figures broadly and conclude with some 
probability that the hardships and risks of a military life, the change from the 
country to the capital, and perhaps the youthful age of the recruits had an 
important influence on the mortality. 


Ainongst the soldiers were some very old men, 19 of them ranging in age from 
70 to 105; they were no doubt attached to the army in some sort of official 
capacity, apparently after receiving their formal discharge, if we may judge from 
No. 3453, which records the death of a soldier at the age of 70, and states that he 
was “ probatus ” at 22 years, served for 28, and lived 24 years 3 months and 11 days 
after receiving his discharge (“post missionem”). The length of service was 
sometimes very long; three inscriptions record 43, 51 and 55 years. 


Hispania and Lusitania. 


From the Corpus and Ephemeris I have extracted the ages at death of 
1111 males and 885 females. I have not kept a record of the social status of 
this population, but from an inspection of the index in the Swpplementum 
I find remarkably few set down as freedmen or slaves; it may not have been the 
custom to give this information, or we may be dealing with a population of higher 
social status than the Romans. 
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In very few of the inscriptions are the months and days recorded, and the 
same piling up of frequencies at ages represented by multiples of 5 occurs as in 
Rome. 


There is a very marked paucity of deaths in childhood, and a relatively large 
number of deaths in very old age—two at 115 and one at 125 are specially 
noticeable. The result is that the expectation of life in these provinces, both 
for males and females, is far higher than in Rome for the first 30 years of life, the 
difference decreasing in the case of males from 174, and in the case of females 
from 13 at birth to 54 at the age of 30; from 30 to 45 years of age it continues 
to decrease, and in the period 45—75 it is about the same, the small difference 
being in favour of Rome, but after 75 it is again higher. 


Up to 60 years of age the expectation of life of females is less than that of 
males, the difference varying from about 6 to 2; it is then about the same for the 
next 15 years, but from 75 to 95 it exceeds that of the males. From 90 onwards, 
however, no weight can be placed on the figures, as only five females lived beyond 
90. (See Fig. 2.) 

As compared with modern lives the expectation in the case of males is very 
much less up to 40, a little less from 40 to 50, about the same from 50 to 60, and 
from 60 onwards it is higher; in the case of females it is very much less up to 50, 
considerably less from 50 to 60, about the same from 60 to 70, and from 70 onwards 
it is higher. (See Fig. 2.) 


Africa. 


Here, fortunately, the data are more plentiful, there being 6238 males and 
4459 females, and in consequence the polygons of expectation are much smoother, 
but before discussing the latter I have to make a few preliminary remarks which 
raise points of some interest. 


Differences of Reading. As usual the ages are almost always given in Roman 
numerals (rarely in words), and when the stones have been injured or weathered 
or lie in inaccessible positions, naturally the numbers are difficult to read and are 
variously given by different observers. I have usually adopted the latest reading, 
or the one that seems attested by the best authority. 


These discrepancies may arise in various ways; the most common in the 
African inscriptions are the following: 
I and X in excess or defect, of frequent occurrence 
[ for, V, eo.) XT Mor XV 
EO ee Sjoryyueme DDG I ce 1DD.0.4 
I 1p Cee ieee bp. 6 
Vi cree XG nom ar NENG atta KON XG 
b.wt BD.D. See 1p. 


V and L in excess or defect, rare. 
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Sex. There is sometimes a difficulty, chiefly with Punic and other native 
names, in determining the sex, and in consequence I have omitted all doubtful 
cases. 


Size of Families. On one inscription, No. 17463, it is stated that the deceased 
were the father and the mother of 12 children, from which it may perhaps be 
inferred that families of this size were unusual. 


Record of Age. The ages are given in the vast majority of cases in years 
only, but occasionally months and days and even hours are added; 12794 gives 
the age with extreme precision: “anno uno M VIII diebus XX noctu una orabus 
IIII.” The age of Christians seems to have been stated as a rule with greater 
precision than that of non-Christians: PM (i.e. plus minus) is often prefixed 
to the years, and months and days are relatively more frequently added; also the 
date of death in terms of the year of the province is often given on the Christian 
inscriptions. 


Centenarians. It will be observed that I have recorded ages up to 182, but 
I have noted four inscriptions giving the ages as 140, 155, 160 and 170 years, 
which I have rejected as being incredible. It is a question whether the symbol 
for a hundred, viz. c, does not in such cases stand for circiter. In one case, 
11902, where the c seems to have been written as a small letter above the 
base-line of the tens, “xxx, I have taken the age as 30. 8008, erected by a father 
in memory of a son, which I have not included, has va cv, and the editor notes 
“fortasse circiter quinque.” Also in 3934 the editor expands ovr XXXV into 
cir(citer) XXXV. 

In 9106 the age of a female is given as 120 years 5 months and 25 days on 
the authority of Wilmanns; here the interpretation of © as circiter seems to 
be excluded by the completeness of the record and we may accept the age as 
correct. Perhaps I should have taken 120 as the maximum limit of age, but 
the few cases which I have taken above 120 affect the result very slightly. — 


As against the probability of those very advanced ages I may quote from 
11594—the inscription on the tomb of a man who died at the age of 82 years 
7 months—these words: “non digne cito vita caruisti, vivere debueras annis 


fere centu’,” which seem to indicate that 100 was popularly considered the 
maximum limit of life. 


Causes of Death are hardly ever assigned, but these two inscriptions are 
interesting: 9048, “duos una dies et pestis acerba abstulit hos pueros”; 18792, 
“OQ non ut meruit pest: vita functus est.” 9050, “quae vixit sine febribus,” 
would point to the prevalence of fever. 


Several cases of violent death—iuterfectus, gladio percussus, a tauru deceptus, 
jugulatus, ferro petitus—have not been included. 


Social Status. In the great majority of cases the social status of the deceased 
is not given. A considerable number were soldiers of various ranks, but their 
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length of service is seldom recorded; slaves are not uncommon—in two “sepulcreta 
familiae domus Aug. Carthag.,” 331 males and 215 females have their ages recorded, 
and I presume most of these were slaves or freedmen, and slaves are mentioned 
elsewhere, but very few freedmen. There are a few pagan priests and priestesses, 
and a considerable number of Christians appear, their dates extending from the 
3rd to the 6th century A.D. 


Frequencies. There is the usual piling up of frequencies at ages denoted by 
multiples of 5, but the effect on expectation of life is not so marked as in the case 
of Rome and Hispania. 


Expectation of Life. In Africa the conditions were much more favourable to 
longevity than in Rome and Hispania. 


Up to 40 years of age the expectation for males was not more than 38 above 
that for females, and after 40 it was almost exactly the same. As compared with 
Rome, the expectation for both sexes was higher at all ages, the excess ranging 
from 26 for males and 244 for females at birth, to about 5 at the age of 60; as 
compared with Hispania it was higher at all ages up to 90, especially in the case 
of females, the excess ranging from a maximum of about 12, As compared with 
modern English the expectation is lower for the first 30 years of life, ranging 
downwards from about 8 in the case of males and about 11 in the case of females, 
but after 40 it is higher and goes on increasing until the end. 


Conclusions. Even allowing for the scantiness of the data we may draw the 
following broad conclusions : 


1. The evidence of the inscriptions brings out very clearly the extreme 
unhealthiness of ancient Rome, and on this point it agrees with the literary 
evidence*, 


2. Leaving out of consideration ages above 75 we see that, contrary to 
modern experience, the expectation of life of females in Rome and Hispania 
was lower than that of males; in Africa there was little difference between the 
sexes in this respect, but the males had a slight advantage up to 40 years of age. 
This would point to special influences adversely affecting the health of females in 
the Roman empire which no longer operate in the modern world. 


3. The expectation of life in Rome and Hispania, especially in the case of 
females, compares very unfavourably with that of modern England at the early 
ages, less unfavourably in middle age, but in old age it is higher, owing perhaps, 
as I have suggested, to severe selection in early life. 


4. In Africa the expectation of life was remarkably high, especially after 
40 years of age, and may be due in some degree, not only to climate and strict 
selection in childhood, but also to the considerable influx of colonists from Italy which 


* Friedlinder, Sittengeschichte Roms8, 1. 39 ft. 
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took place in the time of Caesar and the early Empire*—for we may reasonably 
suppose that the colonists were a selected class, possessed of vigour and enterprise, 
qualities which would be inherited more or less by their descendants—and also to 
the fact that a large proportion of the population must have been engaged in 
the healthy occupation of agriculture, for we know that Africa rivalled Egypt as 
a source of supply of grain for the markets of ancient Romef. 


Note. After preparing the material for the Roman and Spanish part of this 
inquiry I found from the Berliner Philologische Wochenschrift, Vol. 24, p. 604 
(7 May 1904) that Professor A. de Marchi published a paper in the Atti dell’ 
Accademia di Padova, 19031, on the average duration of human life in antiquity 
based on the inscriptions in Vol. vi. of the Corpus. He seems to have gone over 
the same ground as I have done as regards Rome, and has also dealt with Gallia 
Cisalpina, where he has found a much smaller mortality amongst children than 
in Rome; in this respect it is interesting to find another province agreeing with 
Hispania and Lusitania and Africa. I regret that I have not seen Professor 
de Marchi’s paper; coming from so distinguished a scholar it carries an authority 
to which the present essay cannot lay claim. 


* Even before Caesar’s time, ever since Rome subdued Carthage, a sporadic emigration of Italians 
into North Africa had been going on (J. S. Reid, The Municipalities of the Roman Empire, 
Cambridge, 1913). Professor Reid points out, however, that ‘‘emigration on anything like the 
modern scale, or even on the ancient Greek scale, was unknown to the Romans,’ and with regard to 
Roman Africa he writes: ‘‘The impression that must be produced by a scrutiny of the evidence is that 
but a minute fraction of the population could claim Roman descent.” (See page 316 of his book.) 

+ Mommsen, Rémische Geschichte*, v. 647, 651. 

The reference appears to be incorrect as no such paper could be found in the Atti in a careful 
search of the volumes at the British Museum. 
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TABLE I. Number of Deaths at each Year of Life in Rome and the Provinces. 


| | 


Years Rome Hispania and Lusitauia | Alrica 
completed === a a 
at Death | 3 Q é fe) Ss Q 
| 0 74 46 4 2 32 16 | 
1 197 127 6 6 60 44 
2 218 95 10 | 4 56 41 
3 185 131 10 i) 63 46 
y} 188 Tey 5 6 49 39 
5 202 99 9 13 56 52 
6 176 91 3 6 39 27 
i 142 99 5 5 42 30 
8 132 89 6 3 46 21 
9 107 78 8 13 36 28 
10 91 | 81 8 10 ip} 24 
11 84 | 64 4 if 52 29 
12 93 68 14 16 42, 41 
3 Th) 63 9 4 53 44 
14 72 54 12 5 47 34 
15 92 75 14 18 86 71 
16 84 9] 21 19 46 4] 
7 94 91 1L7/ 15 51 56 
18 152 53) 31 31 8] 50 
19 106 109 16 10 60 34 
20 138 158 | 58 49 142 110 
21 76 | 62 15 13 95 lad 
22 96 ily | 25 23 78 48 
23 68 15) 18 18 69 56 
24 57 70 15 | 7 23 28 
25 129 152 62 60 238 192 
26 3-4 | (Gut 13 21 32 41 
2 5YD) 54 14 22 69 64 
28 64 73 10 18 39 38 
29 31 34. 9 7 21 16 
30 153 L57 60 51 229 196 
3] 16 18 10 10 65 56 
32 38 | 38 10 12 52 34 
33 35 22 9 8 41 48 
34 19 12 7 3 16 8 
85 131 100 35 39 222, 208 
36 23 18 8 3 34 24 
By 28 25 8 8 46 27 
BS 24 16 6 a 24 28 
39) 10 10 5 1 18 16 
40 146 68 55 50 229 202 
Al 14 A 5 6 46 42, 
42 20 11 4 2 39 20 
3 4 af 7 3 30 28 
AA 9 4 4 2 17 9 
4S 64 5] 30 24 167 122 
| AO 9 5 4 = 32 14 
47 9 5 4 3 40 18 
AS 10 10 9 2 20 15 
49 Tl 4 2 = 15 i 
50 69 37 49 51 187 131 | 
51 Ti 4 1 2 72 34 
52 10 3 4 4 3333 23 
| 53 4 3 6 4 48 25 
54 2 3 3 1 15 9 
| DD) QT 16 31 22 168 102 
56 5 5 6 2 27 10 
57 vi 1 5 3 31 y/ 
| 58 12, 1 4 4 21 18 
59 33 1 —— 2 9 5 
| 60 75 34 5383 31 245 194 
O1 4. — 9 4 60 | 35 
| 62 11 3 4 2 28 | 14 
O38 ih — 5 1 29 | 2 
O4 6 4 | 3 — 6 2g 
G5 30 16 | 25 13 | 188 137 
66 3 1 4 1 | 27 14 
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TABLE I.—(continued). 


| Years Rome | Hispania and Lusitania Africa 
| completed ae nla =a i i ae —— ae 
at Death 3 g | 3 Q 3 ie) 
67 3 4 1 2 23 15 
68 4 1 2 aa 20 7 
69 1 = = = 7 5 
70 43 14 48 17 277 179 
va 4 33 6 3 67 42 
(eae 5 3 3 6 27 14 
Sy ti 4 2 3 2 42 15 
V4 2 z 3 = 8 4 
78 16 16 25 14 249 182 
ee 760 | 6 an 1 1 22 ri 
oy ae 4 Tee | 3 a 18 il 
eee 78 4 2 1 = 14 3 
79 1 = = a 2 6 
0) ae Sh) 12 OB hs) 6 264 155 
Sie a ee 1 4 3 62 | 30 
82 3 2 2 1 22.) | is 
83 3 2 2 = 16 12 
84 3 = — = 6 1 
85 13 3 14 2 173 102 
86 3 1 eee || 1 12 7 
oy, 3 2 — | 1 16 9 
88 2 I — | 1 9 7 
89 a =~ = 1 9 1 
90 17 3 5 2 95 58 
ine | 1 l a 33 12 
92 1 1 1 =. 14 5 
93 i Pe 3 = 18 5 
94 2 = Il = 3 = 
95 4 l 3 = 70 43 
96 5 se, 0) ge Be 6 4 
97 2 3 | 3 = 14 9 
98 2 ae oe i 5 1 
99 2 a = =e 6 2 
100 1 1 2 2 46 39 
101 = = 2 ane 19 33 
102 us — = a 6 4 
103 1 Ze = 1 7 10 
LOL — = = = 1 as 
105 2 =e i an 39 20 
106 — l = 3 il 
107 = S|. an 5 l 
108 = = | -— = — 1 
109 = = | = ss il = 
110 a = Peet = 21 8 
111 = = i oo = 3 2 
112 = et Ss a 1 a 
1138 H la = = at i = 
11h = _ = = 1 wa 
115 ie ae l 1 i 9 
116 ies = yu ane se _ 
itty ae —_ =t 2 & l 
118 = we = a = _ 
119 es = = ==. ii = 
120 * ; oo SS ly. oe 9 | 3 
121 aN = = | as l | l 
122 = = 12a, yaaa | es - me 
123 an ae Pe Ste Sy = l 
124 = es | ws | Ee = = 
125 = = 1 = 4 4 
126 = ani i a a <a at 
127 = —_ =a Ss i ss 
128 Sa = a ie eee ee et 
129 we ae i = a =e, 
130 re ms 3 = ne 1 
131 ae ee = aa 1 —_ 
132 a Ess = a aa 1 

Totals 4575 3490 1111 | 885 6238 4459 
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TABLE II. 
Age of Roman Women at date of Marriage. 


1 
Number Number Number 
Age of Age of Age of 
| Women Women Women 
7 1 LG 1 28 1 
10 3 18 4 30 1 
12 ra 19 2 31 1 
0k} 6 20 6 82 1 
14 6 21 2 56" 4 1 
ener: 8 2 1 
| 6 5 25 2 59 
TABLE IIL. 


Age of the Soldiers of the Roman Garrison on Entering 
the Army. 


Number Number Number 

_ Age of Age of Age of 
| Soldiers Soldiers Soldiers 
a = Pe 

iy || 2 2 8 33 1 

16 15 95 12 38 1 

ie 28 26 ii 43 1 

18 72 27 9 45 1 

19 59 28 1 47 2, 

20 91 29 2 52 1 

21 31 30 3 
le BAe 29 31 2 400 
| VAS) 18 be 4 
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PEDIGREE OF THE S. FAMILY WITH HEREDITARY “LOBSTER CLAW.” 
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After this pedigree was engraved the Rector of N. wrote to the Editor stating that he had found at last the entry of IV. S. (II. 2) in the baptismal register, two years later than that of III. 3, so that doubt as to order of birth 
of III. 2 and III. 3, expressed by the wavy line in the diagram, is now removed. 
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ON THE INHERITANCE OF THE DEFORMITY 
KNOWN AS SPLIT-FOOT OR LOBSTER-CLAW. 


(SECOND PAPER.) 
By GEORGE McMULLAN, M.D. anp KARL PEARSON, F.RS. 


In 1908 Pearson discussed the pedigree of a certain Lobster-Claw or Split-Foot 
Family in this Journal*. Pearson’s information was obtained from several 
members of the family, but his photographs and skiagrams were of one particular 
section of it. McMullan, unacquainted with Pearson’s paper, read a paper on the 
same family before the Reading Pathological Society in 1910. He worked 
principally with a different section of the family and his photographs and skia- 
grams did not cover the same individuals as Pearson’s. Hearing of the latter’s 
paper, McMullan placed his material at the disposal of the Editor of this Journal 
and the present supplementary memoir is the result. During the five years that 
have intervened since the first paper several additions have been made by birth to 
the family; thus while Pearson dealt with 25 deformed individuals the present 
account contains 33+. Further, somewhat fuller particulars have been found of 
the two earliest generations. The order of birth of Generation IIT. and further 
the total number of brothers and sisters were given differently by different 
members of the family to Pearson and to McMullan. These discrepancies led 
Pearson to make some investigation into the church registers, but they disclosed 
a rather chaotic state of affairs. Some facts, however, resulted. Ann J. (IL. 3) was 
married to Francis S. (11. 4), at N. in the Chilterns in 1837, both being unable to 
sign their names. It was supposed that III. 2, 3 and 5 were born before the 
S.’s migrated to W. The names of most of the other children in Generation ILI. 
are to be found at about the proper dates in the baptismal register of W. But 
not until the birth of John S. (IL. 13), who must have died under two years of 


* Biometrika, Vol. v1. p. 69, 1908. 

+ Besides six additional births of deformed children since the date of Pearson’s record, IV. 9 has 
recently stated that her parents had two deformed children who died as infants, IV. 4 and IV. 5. The 
statement is now confirmed by IV. 3, 
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age,—for his brother born in 1866 is also John S—do the names Francis and 
Ann S. reappear. The births of William S. (IIL. 2), James S. (III. 3), and 
Elizabeth S. (III. 5) are fixed by the registers of N., but another James S., and an 
Eliza Ann 8., Charles S. (IIL. 7), Emma S. (IIL. 9) and Thomas S. (ILt. 10) are 
attributed in the registers of W. in four cases to Thomas and Frances S. and in 
one case to William and Ann S.! There was thus very great carelessness in the 
preparation of the baptismal register, or our illiterate couple frequently changed 
their names, or there were other families of the same surname and with children 
with the same Christian names in the parish, although there is no other trace 
now of their former existence. Ultimately, therefore, we are again thrown back on 
the statements made by the two survivors of Generation III, who, it may be 
noted, did not give their parents names correctly, or on those of the grand- 
daughter IV. 3, who has, after an interval of five years, given almost the same 
account of Generation III. as she did in 1908, and on both occasions referred to 
Ill. 2, If. 9 and III. 13, who seem substantiated by the registers, although 
they were not referred to by IIT. 7 and III. 14. Probably Generation III. is now 
in fairly correct order, although it is possible that two further children—who 
must have died young—namely, another John (1848) and another Thomas (1852), 
really belonged to this pair, who were married in 1837 and were still having 
children in 1866. Of course it is not possible to ascertain whether such children 
or III. 9 and 13 really were deformed or not, although IV. 3 asserted that III. 9 
was normal, presumably on the authority of her father, III. 3. 


In the following account a pedigree number with P. P. affixed refers to the index 
number of the corresponding individual in Pearson’s Pedigree in the earlier paper. 
The accounts given there are not reproduced unless there are additions to be made 
to them. 


Account of Pedigree. 


J. 1 and [. 2 are now given and given as normal, because of the following 
legend, recently unearthed, which attributes the origin of the deformity to 
“maternal impression.” 


I. 2 the great-great-grandmother of the latest generation being pregnant 
committed a theft; when charged with the offence she denied it and further 
invoked her Creator to let her child when born have no fingers to steal with, 
should she be guilty. In due course the infant was born without fingers or toes 
and the woman confessed to the theft lest a greater evil should befall her. All 
which account is very picturesque but somewhat unconvincing. As far as one can 
gather, the woman was well advanced in pregnancy; and no explanation is 
suggested of the process by which the digits were separated, nor of their ultimate 
fate. Perhaps the appeal is to the miraculous rather than to a maternal 
impression. But the legend is so far of interest that it seems unlikely that the 
mother or father of II. 3 were deformed, otherwise, there would have been no 
necessity to account for the appearance of the deformity in IL. 3. Pearson gave 
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IT. 3 (P. P. I. 2) one brother and one sister (P. P. I. 3 and 4), but was unable to find 
any real evidence as to their condition. McMullan’s informants provide no less 
than three brothers and one sister, II. 1, 5 and 6 and IL. 2 respectively, all stated 
to be normal. But no evidence is forthcoming as to their names, place of origin, 
residence or descendants. Ann J. (II. 3) is reported to have been a servant in 
a farmhouse at N., and the legend suggests that her mother was possibly of the 
same class as some of her more recent descendants, which have included tramps, 
wandering labourers and paupers. It seems almost impossible therefore to trace 
any of this J.-family now; some years ago Pearson made, without result, inquiries 
of residents and medical men in the neighbourhood of N., both as to the family 
and as to the existence of any similar deformity in the district. No stress ought 
to be laid on the numbers in Generation II., they are probably not exact. Of 
Francis S., Ii. 4 (P. P. I. 1), we know nothing beyond his name and the fact that 
he could not write it. There is no evidence whatever to suppose him in any way 
of the same stock as II. 3*, It is clear that IT. 3 could not, in Mendelian language, 
have been either a pure recessive (RR) or a pure dominant (DD). In the former 
case she would have had no affected children, in the latter case all affected children. 
She must be looked upon as an impure dominant (DR). Thus every normal in the 
pedigree must be looked upon as an (RA), and consequently since every mate is 
a normal, all the affected can only be (DR)’s. 

In Generation III., we have at least nine children born to Ann J. (Mrs S.), 
perhaps there may have been one or two more who died young and without 
possibility of any record of deformity surviving. Of these nine, four were affected, 
three are known to have been normal, one is reported normal (III. 9), and the 
nature of the other (III. 13) is certainly unknown. This gives a quite reasonable 
Mendelian 50 °/ 

In Generations IV. and V., where we kuow the individuals much more 
accurately, there are eight matings of affected (DR) with normal (RR). Of these 
that of IV. 9 and 10 contributes nothing as there have been only two early mis- 
carriages, The remaining seven matings have so far produced 41 offspring, of 
whom 28 have been affected, nearly two-thirds. The ratio is thus nearly 2:1 and 
not 1:1. The odds against such an excess as 28 to 13 are about 100 to 1, and 


°° 


they tell very heavily against any simple Mendelian theory applying to this 
deformity. 

Thus far the normal members of the stock when mated with outside normals 
have bred only normals. This is no convincing argument for Mendelian theory, 
because in no case has an intra-stirp normal been mated either with another intra- 
stirp normal or with an abnormal. Both such matings would have to be considered 
before we could assume that an intra-stirp normal is as free from taint as an extra- 
stirp normal. Further the fact that three out of the seven matings have given 
nothing but deformed offspring is most remarkable and almost impossible on any 
purely Mendelian theory. If IL. 3, 1V. 15 and IV. 21 had been pure dominants, 


* TY. 3 distinctly states that there was no consanguinity. 
49—2 
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(DDYs instead of (DR)s, this result would have occurred. While if III. 5, IV. 3, 
IV. 13, 1V. 17 had been (DRYs their 26 offspring would have given 13 affected 
and 13 unaffected as they actually do. It appears fairly clear that III. 3, IV. 15 
and IV. 21 were in some manner far more prepotently affected with the taint than 
the other four. But even if we examine the other four, 1V. 3 and IV. 13 seem 
to be affected in quite different degrees; thus not only on the average, but 
individually, the seven families are very improbable on the Mendelian hypothesis ; 
they suggest a graded intensity of taint in the germ-plasm of the different affected 
members of the stock. But the moment such relative prepotency of the individual 
is admitted, Mendelian segregation in definite ratios falls to the ground. A still 
more difficult point is the definition of what a Mendelian “unit” character is in 
such a case as the present. McMullan’s fuller descriptions completely confirm the 
very great variation in the deformity, which as Pearson pointed out in 1908 could 
affect any number from 10 to 60 bones. But it is not only the bones that are 
affected. The usual type is a claw-like foot or hand, the digit or digits remaining 
being bent round at an angle to the remainder of the hand or foot. This is very 
manifest in the bulk of the skiagrams and photographs of the present and the 
earlier paper. But the case of V. 8, born since the earlier paper, and several 
photographs in this paper show that a single straight digit is a possibility. 


Ann J. II. 3 (P,P. 1.2). Mrs Francis 8S. Married to Francis S. at N., April 15, 1837. 
There is nothing to add to the description of the deformity given in this case by Pearson. 

James S. III. 3 (P. P. ID. 6). Born March 4, 1838. Both hands and feet deformed. 
Pearson’s and McMullan’s informants agree as to feet both presenting Ist and 5th toes only. 
But Pearson’s informant (IV. 3) says there were two fingers on right hand, one finger and 
thumb on left; while McMullan’s informant says there was only the fourth finger on both 
hands. The difference between the right and left hands stated positively by the eldest daughter _ 
must, we think, stand. 

William S. III. 2 (P. P. IL. 2). Born February 10, 1840. Normal. Year of death not 
found. Three normal children TV. 1 a—c (P. P. IIT. 1—3). 


Jane Rk. IIT. 4, a native of C., was quite unrelated to her husband, IIL. 3. 

Elizabeth S., afterwards Mrs Wh. III. 5 (P. P. II. 3), dead. Born April 23, 1842. Both 
MeMullan’s and Pearson’s informants agree that she had normal hands; the former reports both 
feet as having only their 5th toes, the latter as having the Ist and 5th toes. 


Wh. III. 6(P. P. IL. 4). Husband of ITI. 5; he was unrelated to his wife, but is now 


reported to have had a deformed foot, quite different from those of his wife, and probably to 
judge by the description a club-foot. 


George S. III. 12 (P. P. II. 10) (dead). Date and locus of birth? no information beyond that 
already given by Pearson that both hands and feet were deformed. 


John S. III. 14 (P. P. IL. 3), seen by Pearson’s field worker and again by McMullan *. 
Born, 1866. Scavenger, unmarried ; both hands and feet affected. 


Feet. Examination refused, but said he had the 5th toe only present on each. 


* Bones present and those absent in this as in other cases made out by palpation. Such a method 
leaves something to be desired, but many of the affected persons feel their affliction keenly and are not 


willing to have it further investigated even if X-ray apparatus were available, which it is not in country 
districts. 
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Hands. Carpus apparently normal on both right and left; metacarpals: Ist, 3rd, 4th and 

5th present on right hand, 2nd absent or rudimentary; 3rd, 4th and 5th present on left hand ; 

4th finger three phalanges present on both hands. Thumb on right only, rudimentary, one 
phalanx. 


S. Mrs R., 1V.3(P. P. TIL. 21). MeMullan’s informant gave no additional information, 
and the skiagrams in the first paper fully describe the case. 


IV. 4 and 5. Two children, sex unstated, reported recently by IV. 9, said to have died as 
infants and to have been deformed. 


George S. IV. 6(P. P. III. 22). Dead, unmarried. Hands and feet deformed ; deformities 
irregular, described by Pearson’s informant ; they involve polydactyly : see doe. cit. 


Richard S. 1V.7(P. P. III. 23). Inmate of workhouse, unmarried. Hands and feet deformed. 


Hands Right Left 
Carpus. Trapezium probably absent. Normal. 
Metacarpals. 2nd, 3rd, 4th and 5th present. Ist, 2nd, 3rd, 4th and 5th present, 1st 
rudimentary. 
Phalanges. Mid-finger, one rudimentary phalanx. ‘First’ finger, one rudimentary pha- 
4th finger, three phalanges. lanx. Ring finger, one fairly well- 


developed phalanx. Little finger, 
three phalanges. 

Feet. Both show the same deformity. 

Tarsus. Probably middle and external cuneiform bones absent. 


Metatarsals. 1st and 5th only present. 


Phalanges. 1st toe, two fused phalanges. 5th toe, three fused phalanges (i.e. no moveable 
joints are demonstrable). A web of skin is present between the two digits. 


For photographs of hands and feet: see Plate XX VII, Figs. (xv) and (xvi). 


Francis S. IV. 8(P. P. IIT. 26), Farm labourer. Hunds said to have one finger only, the 
4th, present. 


Feet said to resemble last case. 


Annie Eliza S. (Mrs F.). IV. 9(P. P: III. 24). Married, no living children; has had two 
abortions at the 4th month (V. 11—12). Has both hands and feet deformed. 


Hands Right Left 
Carpus. Normal. Normal. 
Metacarpals. Normal. Normal. 
Digits. Thumb, one very rudimentary bone. Thumb, short otherwise normal. 
Index finger absent. lst finger, one small rudimentary 
Mid and ring fingers united, one nail phalanx. 


only present, and bones partially Mid finger, very long, three phalanges. 
fused to each finger, three phalanges. — Ring finger dichotomous at 2nd joint. 
4th finger normal. 4th finger normal. 


Feet. Both show a similar deformity. Presence of middle and external cuneiform bones 
doubtful. Ist and 5th metatarsals only present. Phalanges, normal in number, but ankylosed. 

For photograph of hands: see Plate XXV, Fig. (vi); and for that of feet : see Plate XX VII, 
Fig. (xviii). 
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Herbert S. IV. 11 (P. P. UI. 27). Farm labourer, unmarried. Hands and feet deformed. 
Hands Right Left 

Carpus. Apparently normal. Apparently normal. 

Metacarpals. Ist, 3rd, 4th and 5th present. 1st, 2nd (probably present but small 
and rudimentary), 3rd, 4th and 
5th fairly well developed. 

Digits. Thumb, rudimentary, one small pha- | Thumb rudimentary, one small bone. 


lanx. 
Ring and 5th fingers united by thick 
fleshy web. Three phalanges to 


Middle and ring fingers fused, one 
nail. Three phalanges to each. 
5th finger normal. 


each. ; 

Feet. Show a deformity similar to Eliza S. (IV. 9). 

For photographs of hands and feet: see Plate XXVI, Figs. (ix) and (x). 

Wm. Wh. IV. 12 (P. P. IIL. 4). Dead, unmarried. Hands and feet said to have been 
deformed. Said to have had only the little finger and 5th toe present. 

Alfred Wh. IV, 13(P. P. III.5). Married. (Four normal children, V. 12—15.) Hands and 
feet said to be deformed. Hands said to be broad with 5th digit present on each and a thumb 
on right hand. Feet to have 5th toe only present. 

Chas. Wh. IV. 15 (P. P. III. 7). Married, whereabouts unknown (has three deformed 
children). Hands and feet in this case said to be affected ; in what manner not specified. 

Eliza Wh. (Mrs Sm.). IV.17(P. P. III. 9). Recently an inmate of Claybury Asylum suffering 
from confusional insanity. Discharged recovered. Hands and feet deformed: has had three 
children, two male deformed and one female normal. One of the male children and the female 
child are dead. Stated when in asylum that she had had a fourth child with deformed hands 
and feet when sixteen years old, but statement doubtful. 


Hands Right Left 
Carpus. Normal. All the bones appear to be present 
though the trapezium and trape- 
zoid are not well developed. 
Metacarpals. 1st very rudimentary. 3rd present, rudimentary. 
2nd badly developed. 4th and 5th well developed but show- 
3rd, 4th and 5th normal. ing a measure of deformity. 
Digits. One, the 5th, alone present, shows A small bone lies between the heads 
ankylosis. of the 4th and 5th metacarpals, 
possibly a rudimentary digit. 
The 5th digit present and_ badly 
developed. 
Feet, Tarsus. In the case of the feet the deformity is difficult to determine, both however 


show either absence or very imperfect development of the middle and external cuneiform bones. 

Metatarsals. The 1st and 5th alone are present in the left foot while the right shows these 
two bones, and also a rudimentary 4th metatarsal. 

Digits. 1st and 5th alone present, the number of bones in each is normal, but much deformity 
is shown, i.e. ankylosis, dislocations, etc. 

Skiagraphs, hands and feet, Plates XXIII—XXIV; photograph, Plate XXV, Fig. (v). 

Our best thanks are due to Dr Robert Jones of Claybury Asylum for acquainting us with the 
facts regarding this case, and providing a photograph and skiagraphs of the deformities. 

G. Wh. IV. 19 (P. P. ILI. 11). Normal, married, with eight normal children, V, 22—29 
(P. P. IV. 8—14, and one additional birth). 
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Thomas Wh. IV. 21(P. P. IIT. 16). Green keeper on golf course. Married, four children 
showing deformity. 


— 
— 


a 
Right and Left. 


Hands. Carpus. Trapezium presence doubtful. Trapezoid probably not fully deve- 
loped on left hand. 


Metacarpals. Three inner bones alone present. 


Digits. 4th finger only present, very much curved, three phalanges. 


Feet. Tarsus. Presence of middle and external cuneiforms doubtful. 

Metatarsals. 5th alone present. A small hard mass is present lying over internal 
cuneiform bone, possibly a rudimentary 1st metatarsal. 

Digits. 5th alone present, much curved, three phalanges. 

For photographs of hands and feet: see Plate XXVII, Figs. (xiii) and (xiv). 

Annie Wh. IV. 23 (P. P. IID. 13). Normal. 

Henry Wh. IV, 24 (P. P. III. 16). Dead, normal; three normal children, V. 34—36. 

Clara Wh. IV. 26 (P. P. III. 18). Dead, normal; one normal child, V. 37. 

John Wh, IV. 28(P. P. III. 19). Normal. 

We now come to Mrs F.’s family : 

V. land 2 (P. P. IV. 17, 18). Two normal sons. 

V. 9, born since the first paper was written, also a normal boy; V. 10, ditto, but stillborn. 


The remainder of the family are deformed. All have the little finger only on the hands, but 
as Pearson’s Plate IX and skiagrams in the earlier paper show, this may mean very different 
bones missing even in the two hands of the same individual. 

Veto — Papa IVElOSs Ve Abn Ba Ve 205) Viob=Ps PanVo21- \Ve6—BP: PB. IVa 22. Vo i= iP2P: 
TV. 23. Accounts will be found in Pearson’s paper, detailed in the cases of V. 5 and V. 7. 

V. 8=P. P. IV. 24, the “expected birth” of the first paper recorded in a footnote (p. 74) 
as a girl deformed in hands and feet. She differs from the other members of this branch 
of the family in having a single (little) finger straight on the left hand; but this single finger is 
crooked on the right; she has two toes (big and little) on each foot, both “clawed” round, being 
thus like her mother, and differing from her sister, V. 3. 


Eliza Wh. V.16(P. P. IV. 4). Hands and feet deformed. 


Hands Right Left 
Carpus. Scaphoid presence doubtful. Trape- Scaphoid and trapezium probably 
zium probably absent. absent. 
Metacarpals. 2nd, 3rd, 4th and Sth present. 2nd (rudimentary), 3rd, 4th and 5th. 
Digits. Ring and little fingers closely united Ring and little fingers closely united 
by fleshy web, double nail. Three by a thick fleshy web, double nail. 
bones in each. Three phalanges in each finger. 


Rudimentary middle finger, one 
small phalanx. 
Feet. Same deformity in each foot. 
Tarsus. Presence of the middle and 1st cuneiform bones doubtful. 
Metatarsals. 1st and 5th only present. 
Digits. Ist and 5th alone present, much curved, resembling a crab’s claws, normal 
number of bones in each digit, but joints are ankylosed. 
Fred. W. V.17(P. Pb. 1V. 5). Hands and feet deformed. The deformity of the hands is 
difficult to determine (see diagram), but probably the rigid bar of bone felt running along the free 
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border of the hand is due to a displaced phalanx, or to two phalanges fused and displaced*. The 
hands as in the majority of cases show poor muscular development ; the five metacarpals are 
palpable on the left hand. 


Rigid bars of bone. 


Hands Right Left 
Carpus. Apparently normal. Normal. 
Metacarpals. Ist absent, 2nd rudimentary, 3rd Normal but the middle three are de- 
rudimentary, but larger. flected towards the thumb, 


4th and 5th of normal length. 
Phalanges. In both hands a bone is felt lying between the heads of 4th and 5th meta- 
carpals, probably a phalanx of ring finger : see diagram above, 
5th digit shows three phalanges. 1st digit rudimentary. 
5th digit, three phalanges. 

The feet show the same deformity as those of Eliza Wh., his sister. 

For photograph of hands: see Plate XXVII, Fig. (xvii), to right. 

Henry Wh. V.18(P. P. IV. 6). Hands and feet defective. Hands have one digit only on 
each ; feet resemble those of his sister, Hliza Wh., V. 16. 

Hands, symmetrical. 

Carpus. Trapezium and trapezoid bones probably absent in both hands. 
Metacarpals. 3rd metacarpal probably present but very imperfectly developed. 
4th metacarpal better developed. 5th metacarpal normal. 
Digits. 5th only present on each hand, much curved, and joints partially anky- 
losed. 

For photograph of hands: see Plate XXVII, Fig. (xvii), to left. 

William Wh. or Sm. V.19(P. P. TV. 7). Dead. Hands said to have one finger only present- 
Feet normal. 

Alfred Wh. or Sm. V. 21 (P. P. unborn). At present in one of Dr Barnardo’s homes. One 
finger only on each hand, the 4th, well developed. eet. The outer digits alone present, though 
feet appear well developed save for the absence of the toes. 

We are indebted to Dr C. T. Ewart of Claybury Asylum for the facts and photograph of 
this case. 

For photograph: see Plate XXV, Fig. (vii). 

William Wh. V.30(P. P. 1V. 15). Hands and feet deformed. 

Hands. Right and left are symmetrical. 


Carpus. Trapezium probably absent. 

Metacarpals. | The inner three :lone present. 

Digits. Little finger only present and has three phalanges with freely moveable 
joints. 


* This case is of peculiar interest as showing in the S.-family the occurrence of the same ‘ bridge”’ 
type or ‘‘ cross-bone” type that Lewis and Embleton have illustrated by skiagrams in the G.-family ; see 
Biometrika, Vol, v1. p. 41 and plates. 
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Feet. Also symmetrical. 
Tarsus. The external and middle cuneiform bones are probably absent. 
Metatarsals. 5th alone present. 
Digits. A small and very much curved little toe only is present ; it probably has 

the normal number of bones in it. 
For photograph of hands and feet: see Plate XX VI, Fig. (xi). 
Thos. W. Wh. V.31(P. P., then unborn). Hands resemble those of his brother (V. 30) in 
every respect: but have more marked grooves between metacarpal heads. 


Feet Right Left 
Tarsus. The presence of the middle and ex- = Apparently normal. 
ternal cuneiform bones is doubtful. 
Metatarsals. 5th alone present. Probably a rudimentary Ist. 
Ist present, rudimentary. 5th present. 
Digits. 5th alone present and curved. 5th normal in number of bones, but 
Small mass of fibrous tissue overlies curved. 


the rudimentary Ist metatarsal. 


This child is only five years of age and it is difficult to say with any degree of certainty which 
bones are present and which absent in the case of the carpus, tarsus and the metacarpal and 
metatarsal bones. This is true also of the following case. 

For photograph of hands and part of feet: see Plate XXVI, Fig. (xii). 

Rk. Wh. V. 32 (P. P., then unborn). Hands and feet deformed. 


The hands are symmetrical and probably have the trapezium, at least, absent. Either the 
inner two or three metacarpal bones are present, but are badly developed, except the innermost. 
One digit, the little finger, alone is present, and has the usual three bones ; it is very much 
curved. 


The feet are asymmetrical. 


Right Left 
Tarsus. Possibly normal, though the cuneiform bones are not palpable. 
Metatarsals. 1st rudimentary, 5th normal. lst and 5th present. 
Phalanges. Three bones in 5th digit. The first toe is long, and apparently 


composed of two bones. 

The little toe is as long as a child’s 
middle finger, perfectly straight 
and without a nail. Three bones 
present. 


For photograph of hands and feet: see Plate XXV, Fig. (viii). The photo does not give a 
proper idea of left foot which looks more as in diagram. 


— Wh. V.33(P.P.,then unborn). An infant, three months old. Hands and feet deformed, 
the deformity of the hands resembling those in V. 32. 


The feet both show a well-developed little toe, and a mal-developed great toe. 
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The additional information now obtained is of much interest. We can assert : 


(i) That our increased knowledge has not shown the deformity to give any 
closer approach to Mendelian inheritance—on the contrary the disproportionate 
number of abnormals has increased. 


(ii) That the experience of the last five years further emphasises the 
importance of legislative interference with the propagation of deformity. 


Pearson in 1908 laid stress on the wide variation found in the types of hand 
and foot, and remarked that all Lewis and Embleton’s types from the G.-family 
with the exception of the “cross-bones” had been found in the S.-family ; but he 
suggested that this might also be revealed if the whole family were skiagraphed. 
McMullan has now shown that this type actually exists. 


A careful comparison of the present pedigree with the former one will show 
that, if additional births be put on one side, the chief changes are in the orders of 
birth in the earlier generations. We cannot lay much stress even now on these 
orders. In the class with which we have here to deal, the members frequently 
seem uncertain of their own ages—and are therefore ignorant of their brothers’ 
and sisters’ ages and their order of birth. When we find that they do not know 
their own mother’s Christian name correctly, we need not be surprised that they 
contradict each other’s statements as to the relative ages or even as to the numbers 
of their brothers and sisters. This especially will be the case in people of this 
class when the youngest is born 25 to 28 years later than the eldest. Unfortunately 
the Church baptismal registers appear equally chaotic; they make no pretence to 
identify father and mother by the same names in two successive years! On the 
whole there is a reasonable agreement in the two accounts. Those who have 
studied such families as those in the G.* and S.-pedigrees of Lobster-claw, must 
earnestly wish for some immediate step to be taken to restrict the propagation 
of the physically defective; it is no less urgent than the restriction of the 
propagation of the mentally defective. 


* We have further evidence that in the G.-family there has also been a considerable number 
of births of deformed individuals in the last five years, and a similar experience has arisen in the 
case of a number of pedigrees of hereditary deformity recently brought up to date by the Eugenics 
Laboratory staff. 
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Fig. (i), 2. Sm. Pedigree IV. 17. Right Foot. (See Plate XXV, Vig. (v).) 


. Sm. Pedigree IV. 17. Left Foot. 
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Tig. (ii). L. Sm. Pedigree IV. 17. Right Hand. (See Plate XXV, Fig. (v).) 


Fig. (iv). H. Sm. Pedigree 1V. 17. Left Hand. 
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Fig. (vy). 2H. Sm. Pedigree IV. 17. Hands and Feet. ig. (vi). dA, W. FF. Pedigree IV.9. Hands. (See Plate XXVII, 
(See Plates XXIJI—XXIV.) Fig. (xviii).) 
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Fig. (vii). dA. Wh. Pedigree V. 21. Right Hand, and Feet. Fig. (viii). R. Wh. Pedigree V. 32. Hands and Feet, 
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Fig. (ix). H. S. Pedigree IV. 11. Hands. Fig. (x). H.S. Pedigree IV. 11. Feet. 


Fig. (xi). JW. Wh. Pedigree V. 30. Hands and Feet. Fig. (xii). 7. IW. Wh. Pedigree V.31. Hands and Part of Feet. 


“LOBSTER CLAW” FAMILY. 
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Fig. (xiii). 7. Wh. Pedigree IV. 21. Hands. Fie. (xiv). JZ. Wh. Pedigree IV. 21. Feet. 


Fig. (xv). R. S. Pedigree IV. 7. Hands. Fig. (xvi). R. S. Pedigree IV. 7. Feet. 


Fig, (xvii). H. Wh. Pedigree V.18. I’. W. Pedigree V. 17. Fig. (xviii). d. H#.F. Pedigree IV. 9. Feet. (See Plate XXV, 
Hands. Fig. (vi).) 
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(1) Introductory. 

The measurements on which the following paper is based were taken by 
Kathleen V. Ryley, in part at the Museum of the Royal College of Surgeons by 
the kindness of Professor Arthur Keith; in part at the British Museum (Natural 
History), where we have especially to thank Mr W. P. Pyeraft; in part at the 
Cambridge Anatomical Museum and in the private collection of Dr Duckworth, whom 
we have heartily to thank for his courteous and ready aid; in part at the Oxford 
Museum by the kindness of Professor Bourne. Lastly we owe the possibility of 
measuring a few specimens to Professors Thane and Hill of University College, 
London, who readily placed their material at our disposal. We have further to 
express our indebtedness to Professor Thane and to Dr Derry for aid in a variety 
of other ways. Only by the great help received from all these authorities would 
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it have been possible to bring together even the slender numbers now dealt with, 
and a very full sense of our obligations to these gentlemen must here be 
expressed. 

In a recent paper entitled “A Study of the Negro Skull” (Biometrika, Vol. Vu. 
pp. 315—320) a special study was made of the nasal bridge in the Congo and 
Gaboon crania and the results compared with measurements on Egyptian and 
English series. There resulted values so suggestive that a wider study was 
promised, and that promise is to some extent fulfilled in the present paper. 


In the former paper it will be remembered that the chord from dacryon to 
dacryon (DC) was measured—the mesodacryal chord—and the minimum are from 
dacryon to dacryon (DA )—the mesodacryal are. The ratio 100 (are — chord)/chord 
was termed the mesodacryal index 8. If on the other hand we measure the 
subtense of the minimum are (DS) and take the index 100 subtense/chord, 
-we obtain a second mesodacryal index, which we will term here the mesodacryal 
index a As @ cannot be found without a special instrument, or a special 
construction, an attempt was made to determine the second mesodacryal index a 
from the first 8 by the hypothesis that in man the bridge of the nose may be 
sufficiently closely represented by a catenary, which seemed a fairly justifiable 
assumption in this case. The index thus indirectly found will be termed in this 
paper the mesodacryal index a’, Tables to determine a’ from 8 calculated by 
one of us (Julia Bell) were published in the same number of Biometrika, 
pp. 338—9, as the paper on the negro skull. An extension of these Tables due 
to H. E. Soper, rendered necessary when we deal with apes, will be found in the 
footnote on p. 401 of the present paper. Meanwhile our attention was drawn by 
Professor G. D, Thane to Mérejkowsky’s pioneer work of 1882 on the nasal bones, 
He measured the shortest horizontal chord from nasal suture to nasal suture (SC) 
and the minimum subtense (SS) and took the index 100 SS/SC, which we have 
ventured to term the Simotic Index, S. There is little doubt that this index is 
of a more simple anatomic character, as being only concerned with the nasal bones, 
than the mesodacryal indices. But it is by no means so marked a physiognomic 
feature in the living as the mesodacryal index. It has further to be obtained by 
aid of a special instrument—in our case a modified Mérejkowsky instrument,— 
which may be termed the simometer. This simometer may also be used to test 
the relationship between observed a@ and calculated a. 


In the present investigation a, 8, S have been measured and a’ calculated for 
Gorilla, Chimpanzee, Gibbon and Orang. But in the case of the latter ape it 
was found possible to measure the simotic index in only a few cases,—the index 
taking frequently the form of the ratio of two almost vanishing quantities*. In 


* There were three cases in which the simotic chord was recorded among the Orang-utans. In two 
males it was 5°7 and 5:8 mm. In both these cases the subtense was zero, for the bones were practically 
flat; thus the simotic index was zero. In the third case, a female, the simotic chord was 7:0 mm. and 
the subtense 1:2 mm. and therefore the simotic index was 17:1. There were a few further cases in which 
the chord only was measurable. Thus on the basis of such isolated cases we should have to place the 
Orang-utan below the Chimpanzee, and both of course below the Negroes. 
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the Orang the nasal bones are fused into a narrow flat strip from 5 to 6 mm. wide, 
sometimes there is only a thread at the nasal bridge or the nasals are entirely 
absent. In some cases the maxillary bones stand out forming a ledge which over- 
hangs the region of the dacryon and extends beyond it, thus making it very difficult 
to measure with Mérejkowsky’s instrument. Initially we excluded entirely from 
our measurements the Gibbon, but notwithstanding the difficulties of including 
this ape, it afterwards appeared to us that it might be worth while to indicate 
the results in rough numbers. The difficulties partly arise from the fact that a 
large proportion of the Gibbon crania available are without any statement as to 
sex and we did not feel that an attempt at sexing would be really profitable, as 
the total numbers were not large enough to give double series. Further we are 
compelled to distinguish between Hylobates and Symphalangus, and even of the 
former with the sparse data as to locus usually provided, it is impossible to assert 
that we are dealing with anything like a homogeneous race. Thus we found - 
ourselves ultimately with only 25 cases— ¢+9+ 38 —Hylobates—indeed only 16 
for the simotic characters, and 9 cases— ¢ +? + 8 —Symphalangus—with merely 
six for the simotic characters. Our results in this case can therefore only be 
rough indications of the racial characters of the nose of these apes. Our Museums 
have yet to learn that for comparative study we want at least 50 to 100 crania 
of each sex drawn from definite districts. To compare with those of the apes 
the corresponding indices were measured in a number of human races selected in 
order to contrast the values, if possible, with those of the anthropoid apes of the 
same continent. A few other human races are taken to indicate general relation- 
ships. The Hindoo, Egyptian and English crania used are at University College 
either in the Biometric Laboratory or in the Department of Anatomy; the 
Asiatic skulls and the Congo skulls are at the Royal College of Surgeons; the 
Gaboon skulls at the British Museum of Natural History. The nasal features of 
the latter as well as of the English and Egyptian crania had already been 
discussed in Dr Crewdson-Benington’s paper referred to above. 


(2) Reliability of Measurements. 


The first point to be considered is the degree of reliability in the measure- 
ments made, and here we must again emphasise the difficulty of accuracy. Not 
only do the quantities measured differ by their extreme smallness from the 
customary cranial measurements, but the points for measurement and the 
instrument for measurement are by no means all that could be desired. When 
“a small quantity has to be measured, an error of reading which would have no 
importance in the case of either head breadth or head length, the cephalic index 
being in question, becomes of very great significance in the case of the simotic 
or even the mesodacryal indices. Further in the handling of Mérejkowsky’s 
instrument, there is much play for personal equation, and only after long practise 
will the recorder settle down into a process, which repeats within reasonable limits 
the values of the indices previously determined. In the next place for our present 
purposes only small series of crania were as a rule available, e.g. 20 to 40 of either 
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sex of each ape, 10 to 40 of the various Asiatic races. Considering the smallness 
of these numbers, it did not appear needful, in view of the labour involved, to 
measure more than 50 of each sex of the English (Whitechapel) and Egyptian 
(XXVI—XxXxX dynasties) crania of which far larger numbers were at hand. 


Further in such small series the question of sexing becomes of considerable 
influence. With a large series of crania of one type, we sex on our knowledge 
of that series itself. But in short series one is apt to sex by @ priort impressions 
of sex in other races, because we have not before us wide enough material to 
appreciate sex differences within the race itself. 


We propose to illustrate first the order of the differences which may arise 
from these sources, and we will deal first with sexing. Owing to the courtesy of 
Dr Derry we were able to test this point on 35 Nubian crania in his possession. 
They were independently sexed by Professor G. D, Thane and by Dr Derry 
himself*. The nasal measurements were taken and the mean values of the 
indices calculated. The following values were obtained, where we have added for 
comparative purposes those of other negro series : 


TABLE TI. 


Mm, My Me Me 


Nubians: Professor Thane ... | 48-0 | 40°9 | 49°7 | 43°6 | 49°6 | 40°0 | 34:5 | 23-4 
H Dr Derry ... -- | 48°9 | 39°7 | 51°0 | 42°1 | 51°6 | 37°7 | 33-7 | 23°5 
Gaboon : 1864 ... Sore poor |} <2) 44:0 | 43-6 | 44°9 | 40-0 | 42:0 | 30°9 | 27-7 
* 1880 ... sae ... | 42°9 | -44°1 | 44°8 | 45°] | 41:9 | 42:3: | 99:0) | 22:9 
Congo... ea eee oe | oh | 38722 | 40:5 |) 8826 35358) 32°60 eb GaieZowg 
——___—— 

| “Neeres”: Mérejkowsky | — 25°6 


It will be seen that for short series the personal equation of sexing makes 
1 to 2 points difference in the indices, but that in two series of the same race— 
Gaboon, 1864 and 1880,—differences of this order may occur at least in the simotic 
index. If we compare Mérejkowsky’s value for the simotic index with ours, we 
see that there would be excellent agreement if his somewhat vague “Negres” were 
from the Congo district. But we lay no stress on this agreement because we ' 
believe that Mérejkowsky tends almost invariably to get a higher simotic index 
than we do. This is illustrated in Table II. 

Of course there are but few races common to both series and in Mérejkowsky’s 
series as in ours the numbers dealt with are miserably small, but still cases where 

* Before discussion there were four crania in a total of 35 on which there was difference of opinion, 


say 11 per cent., and we have used the original determinations, putting as usual? ¢ and ?? with the 
$’sand 2’s respectively. 
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we can compare and the general appearance of the Table are both against the 
view that the differences are wholly racial. We believe them to be largely due 
to personal equation in taking a delicate measurement with a not very reliable 
instrument, and consider that without determination of personal equations by 
direct comparison of measurements on the same individual crania, it is not possible 
at present to compare simotic indices found by different craniologists. 


TABI: (i, 


Values of the Simotic Indew. 


Mérejkowsky Ryley 
Race qovaernee eet Race pos 
é 9 é | 8 
Negroes Bee Peet 2076 — Congo 25°6 25°7 
Malays aa soc. || opllcs) — Moluccas : ... | 26°5 — 
New Caledonians 38°5 30°6 | Gaboon, 1880 wes 29-0 22-9 
Mongols ce a 40°5 = Sumatra 29°2 -- 
Melanesians... ies 41°9 35°2 | Celebes 23 29°7 — 
Marquesas Island ... | 43°9 34:0 | Borneo RS neg || x02) 34-7 
Maoris ahs nea deo) 43°3 | Philippines ... .. | 3074 29°0 
American Indians ... 48°0 46°2. | Gaboon, 1864 ah 30:9 | 27-7 
New Hebrides Jeo | AOL 44°5 | Malay i soo |) BEI — 
Hindoos Nag Sua |) aaylal 42°5 | Java and Madura ... 33°9 32°7 
Auvergnats ... ee eos 45-0 | Nubians a ww | 34°] 23°4 
Gypsies ses son|| Sie! — Ainos ts .. | 43°74 --- 
Tahitians... oe 54°3 — Veddahs —... ... | 43°6 36°9 
French oe ... | 54°8 47-1 | Egyptians... w. | 444 36°9 
Dutch Re ae 58°6 55°2, | Hindoos a ise 44:7 39°1 
French Cymry Sen || BNA) 50°3. English ee en) O08 46°6 


In two points, however, we agree fairly well with Mérejkowsky, ie. in the 
general order of races and in the fact that the simotic index is a marked sexual 
character. In all cases—22 in number—except the Congo and Borneo crania, 
the female has a lower simotic index than the male and this is true for the apes, 
where we can determine it: 


; 3 ? 
Chimpanzee 21-4 18°5 
Gorilla 56°8 55'1 


The two exceptions are the Congo crania, where the two sexes are practically 
alike, and the crania from Borneo, where the female has the higher simotic index, 
but the probable errors being introduced, i.e. Borneo: ’s 30:2 + 1:2, 9’s 34°7 + 2-1, 
show us that even here the difference is quite possibly non-significant. Thus we 
conclude that in the female the nasal bones are flatter than in the male, and this 
secondary sexual character may be used—as far as it is ocularly appreciable—as a 
help in sexing. It may be remarked—in order to avoid circular reasoning—that 
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with the possible exception of Dr Derry’s classifications, the judgment in sexing 
has not been influenced by nasal characters*. 


While it appears to us that the difference of sexing will not be greater than 
that of two small samples of the same race and less probably than the difference of 
measurement of two different craniologists, we have still to answer the important 
question of how far the same craniologist will repeat results in the case of these 
delicate nasal measurements. We are able to make some interesting comparisons 
from this standpoint. K. V. Ryley made her first measurements of the nose on the 
Whitechapel, Egyptian, Congo and Gaboon series in the summer of 1911 for use in 
the paper on the negro skull (Biometrika, Vol. vii. p. 316). She was then new to 
these particular measurements, and before the second series were made she had 
nearly a year’s practice on a great variety of races. In the English Whitechapel 
series 13 crania measured in one series (1911) are not in the other series (1912), 
that is to say one quarter of the crania are not the same in the two series. These 
13 crania of the first series were originally Museum specimens, Le. skulls having 
some abnormality (other than nasal), or remarkable feature, and placed in the 
Anatomical Museum for exhibition. It was thought better to exclude these 
individuals in the second series and replace them by more normal examples. 
Thus the divergencies between the A (1911) and B (1912) English series are not 
solely due to personal equation. The Congo and Gaboon 1864 series are practically 
the same crania for A and B as nearly every skull which was measurable was 
measured on each occasion. 


The two Egyptian series are not the same crania, for they consisted of the 
100 crania of the series of 1800 skulls, which were being otherwise measured at 
the time when these nasal measurements were taken. Thus we may conclude 
that the Congo and Gaboon 1864 series represent changes due to the same 
measurer repeating operations at the beginning and end of a period during which 
experience was growing and methods being standardised; the Egyptian series 
represent this variation in the measurer together with the difference between 
random samples; and the English series represent not only the variation due to 
the measurer, but further the effect of modifying by 25 °/, the actual sample 


* Dy Derry does give weight to nasal features in sexing, but if we examine the results in Table I 
from this standpoint, we have 


Difference of male and female, ie. 3 - @. 


Ma ma! mg Mm 
Professor Thane (il 61 9°6 Taleit 
Dr Derry ee oe) 8:9 13'9 10:2 


It will be clear from these results that Dr Derry used the mesodacryal indices, i.e. the entire bridge 
of the nose, rather than the nasal bones in his sexual appreciation ; for he has a less sexual difference 
in the simotic index than Professor Thane, while he has greater values in the mesodacryal indices. 
Asa matter of fact the sexual differences in the Nubians seem to us of such an exaggerated character, 
that we venture to suggest that the females may have been captives and thus may be more definitely 
negresses, with whom their indices accord, than the males were negroes; the males being possibly a 
conquering race. 
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measured. Finally to test the extent to which K. V. Ryley’s methods were finally 
standardised, a third measurement of the male Whitechapel crania was made four 
months later than B, namely C in December, 1912. In the case of C the same 
50 crania exactly were remeasured*. There now resulted a very close accordance 
between all the measurements absolute and indicial. It will be noticed on 
examination of the Table that: 

(i) The error in the value of the simotic index due to variation of personal 
equation never exceeds a point. 


TABLE III. 


Comparison of Measurements on the Nasal Bridge made at intervals and on 
difere ‘ent t samples. 


Meso- ie : ft Meso- 
Seri dacryal Caan Simotic samece os | daeryal Simotie R les 
cue Chord se Chord (SC) u ay : ihe Index a’ | Index (S) Se 
(DC) NU (se) poo | (from £) 
Gaboon, 1864, Ag | 24°5+°2 | 33°54+°3 | 9°164°26 | 2°87+°11 | 42°7+ ‘9 | 454+ °6 | 30°8+ ‘9 |) Same 
- » BS | 23:°2+4°2 | 32-44°3 | 9°434+-26 |] 2°-88+-11 | 400+ °8 | 43°6+°5 | 309+ 915 cramia 
5) » AQ | 2143 | 32°34°4 | 9°55+°28 | 2-68+°10! 44:841°3 | 46°7+°8 | 28°741°0 |) Same 
5 » BD | 22°54°3 | 31-8+°4.] 9884-29 | 2684-1] | 42-041°3 | 44°94 °8 | 27°741°-0 | ( crania 
Congo, 4 ¢ | 24°44°3 | 32°8+-4 | 9'444'21 | 238408 | 34°54 9) 39'6+4'6 | 25°6+ “8 |) Same 
mr BiG ... | 23°94°3 | 82°44'4 | 9°804°21 | 2-484 09 | 85°54 °8) 40°54+°6 | 25°64 “7/5 cramia 
aye eA? | 23°54 °3 | 32-24°4 | 9-464 33 | 2°314°11 | 36°8+1°0 | 41-54 °7 | 25°2+1°3 || Same 
oy one .. | 23°34 °3 | 80°34°3 | 9°72+4°-35 | 2°414°12 | 82°64 °8 | 388°64°6 | 25°741-4 |( crania 
Egyptians, Ag ... | 21°54 °2 | 34:54-4 | 10°584+°18 | 4°674+°10 | 61-°341°5 | 57-049 | 44°8+ -8 |) Different 
x BS ...| Q15+°2 | 35-342 | 10°42+-17 | 4634-10 | 65-041°3| 59°44°8 | 44-44 -7 | ( samples 
_ AQ ...| 20°44 °2 | 82°44 :°2 | 10°764°17 | 4°114°09 | 59°34 °9 | 55°9+4°6 | 38°64 °7 It Different 
Ff BQ ...| 206+4°'3 | 32°0+°3 | 10°844:'16 | 3:°97+:09 | Ose ll || Gam sy | ates yar As) Ky samples 
| | | 25 
English, A ¢ meen 22cOct 2a Soda QroQered 7 | A 7ASeel ll h62ck es | 53°98) 5087 12 Ree 
oy BER meee 2loeee2 34-82-3938 7 | 4s68s= "10 62-7 A -2)| 5802-7 | b0"SsE 12 (ee 
| crania 
| 5° 
3s UA 5 || PAS )ae er || Rae iaoety I) fie tee cults} || agi aie Oy) | 53'44+1°2 | 52°2+:8 |) 474+ -9 cance 
He eB? | 20 OHee2 ola 33 | OO iy | AS1Gsee07 | 53°741°0 | 52°5+°6 | 466+ °8 om 
— — — — —— 
TABLE III bis. 
English Crania, 50 Males. 
Meso- Meso- | gq: x il ‘ : 
Meso- Jaapeal | simotic Simotic Meso- Meso- Meso- 6 : 
Series sooty 5 dacryal wae ae | Chord Subtense dacryal dacryal | dacryal 5 Ree 
(DC) are (DA) (DS) | (SC) (SS) Index B | Index a’ | Index a ae 
r 2 = : _ = 
B 21°54 °2 | 84°84 °3 | 1214 :1 | 938417 | 4-684 710 | 62°741-2 | 58-04-7 | 56-847 | 50°841-2 
C 21-44 °2) 34°64 °3 | 12°14'1 | 9-43 4°17 | 4°724°10 | 62°741°2 | 58-1 4°7 | 57°44°7 | 512412 


A are measurements taken at start of een B at eenciusion of een after 
practise and standardisation, B only used in this memoir. In the Table III b/s both series, B and 
C, were taken on the same crania at the end of the investigation with an interval of four months. 

* K. V. Ryley, it is needless to say, had not her earlier values before her when making her 
remeasurements, 
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That is, it is of the order of the probable error. It may amount to 1:7, which 
is only about twice the probable error when we combine the effect of random 
sampling with personal error. 

(ii) The differences in the mesodacryal index 8 can amount for personal 
variation to 42 (Congo $’s), which is as large a value as we obtain on the 
Egyptian series for combined personal error and random sampling, and even to 
66 points in the Whitechapel with change of sample and personal variation, 
which is five times the probable error. Corresponding differences will of course 
exist in a deduced from 8. We believe that this difference in the early and 
late values of 8 is due to greater standardisation of method in determining the 
dacryon with a rather blunt pointed instrument and to greater mastery of the 
difficulties arising from damaged crania and in the case of the apes from partially 
obliterated sutures. 

(i) In the absolute measurements mesodacyral chord (DC), mesodacryal 
subtense (DS), mesodacyral are (DA), simotic chord (SC), simotic subtense (SS) 
there is a maximum error in the means of about 4°5°/,, but this is rare, and 
reduces to insignificance with practise in the measurements*. 


(iv) A comparison of B and C shows that in all cases after continuous 
practise and standardisation of method, the variations in absolute measurements 
or in indices are less than the probable errors of random sampling. 


Of course the whole of the above remarks apply to a comparison of mean 
values and not to measurements on an individual cranium. We should conclude 
as follows: 


(a) The measurements on the nasal bridge need great care, and the same 
measurer may on repetition differ considerably from a previous determination on 
the same skull. 


(6) But with long practise and careful standardisation of method the racial 
values obtained will repeat themselves, if the series be measured afresh, and 
values obtained by the same recorder are quite intercomparable. 

(c) We are doubtful—especially having regard to Mérejkowsky’s results— 
whether racial values obtained by different craniologists, who are fresh to this 
side of cranial measurement, or who have not tested their personal equation on 
the same series, will be directly comparable. 

(d) Notwithstanding these difficulties, which are really only an intensifica- 
tion of difficulties which occur in all branches of craniometry, we believe these 
measurements on the nasal bridge to be really valuable and think that they 
ought to occupy more attention. They are of course only a phase of what may 
reach greater importance in the future: namely of what we may term mucro- 
metric as distinguished from the usual or macrometric measurements of the skull. 


* The earlier series of measurements tended rather to exaggerate the mesodacryal chord 
and are. 
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There are bones other than those of the nasal bridge which might give racially 
significant micrometric measurements, although specially designed apparatus, or 
an adaption of existing physical instruments as yet unused in craniometry, would 
have to be employed in their determination. 

(3) On the observed (a) and calculated (a) values of the subtense mesoducryal 
index. 


In the discussion on the negro skull (loc. cit. pp. 297, 316) it was pointed out 
that our original measurements of the mesodacryal chord and are leading to the 
index @ did not enable us to determine the subtense directly, or indirectly, until 
some assumption was made as to the shape of the bridge of the nose. We were 
not at the time these measurements were suggested to Dr Crewdson-Benington 
aware of Mérejkowsky’s instrument, which is as available for finding the mesodacryal 
subtense as for its designed purpose the determination of the simotic subtense. 
Accordingly after some consideration the catenary was selected as a fairly suitable 
curve for the nasal bridge and by means of it the value of the subtense meso- 
dacryal index (a’) was deduced from 8. In the present investigation a has been 
found directly and it is of much interest to test the legitimacy of the catenary 
hypothesis. 


TABLE IV. Differences a’ —a of Calculated (a’) and Observed (a) Values 
of Mesodacryal Subtense Index. 


3 2 Average 
Gorilla ... ae ae ite 0°21 0-07 O14 
Ainos ... sie oes eae 0°36 —- 0°36 
NiorTOOs {Gaboon (1864 + 1880) 1-01 0°88 0°94 
o ~~ (Congo e wae 1°45 1°37 1-41 
English TES —0°47 0°36 
Egyptians 1°53 1-12 1°32 
Chimpanzees ... 1-64 1:41 152 
Philippine Islands 1:10 2°16 1°63 
Moluccas 1°63 -= 1°63 
Hindoos 1°79 Beale) 2°45 
Celebes 2°41 == 2°41 
Borneo... 2°76 3°86 3°31 
Javanese 2°20 2°80 2°50 
Sumatra 3°73 -- 3°73 
Veddahs 4°58 4°99 4°78 
Orang-utans ... ene of 17°33 20°23 18-78 
ET Hylobates 18°60 18°60 
Pasa se ee ae | SAS HS ce eee 14-22) 14:22 
| 


We see, to judge by this Table, that the hypothesis of the catenary suits the 
Gorilla and the Aino nose very closely; it is fairly good for Negro, English and 
51 3 
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Egyptian, roughly we must subtract one point from the calculated to get the 
actual value of the index; for the Chimpanzee, the Hindoos and the inhabitants 
of the Philippine and Molucca Islands two points must be subtracted ; for those 
of the Celebes, Borneo, Java and Sumatra three points, for the Veddahs five 
points, and for the Orang-utan and Gibbon 14 to 20! Thus as we pass from 
Africa and the region of the Gorilla through Europe to Asia and to the East 
Indian Islands and the region of the Orang-utan we get roughly a continuous 
divergence from the catenary hypothesis, which fits the Gorilla, to a markedly 
non-catenary relation as in the case of the Orang*, While the Aino and Veddah 
have equal flatness of the nasal bones as measured by the simotic index, they 
stand at opposite ends of the scale with regard to a catenary hypothesis of the 
whole bridge of the nose. No attempt, however, is made to deduce any racial 
order from Table IV although it is not without suggestiveness. 


Another method of approaching the degree of relationship of a to a is to 
consider the correlation coefficient between them, 74. A glance at Table XX 
shows that it is very high. We have, averaging results : 


TABLE VY. Correlation vay. 


Chimpanzees and Orang-utans ... we T54 
Gibbons (Hylobates) are Se Bon) tells) 
East Asiatic Group om ee fer tele) 
Gorillas ae Br Bef ah ... 806 
Negro Group i ae ve a) O28 
Hindoos ae r or re .. ‘931 
Egyptians and English... ‘ies .. "964 
Veddahs ct on See oes OGD 


Finally in order to calculate the value of a from a’ fora given individual of one 
of the races dealt with, we append the regression lines, which provide the most 
probable value of @ for a given @’. 


TABLE VI. 
Male Female 

Orang-utans a= *6697d' +12°99 Orang-utans a= °5053a’ +28°98 
Chimpanzees a= °6629a'+ 8°66 Chimpanzees a= *6096a’+10°50 
Veddahs a= ‘8324a’+ 5°75 Gorillas a= ‘8404d’+ 7°11 
Congo a= ‘843la’4+ 4:91 Egyptians a= ‘8877a'+ 4:93 
Egyptians a= ‘9359a'+ 2°25 Gaboon, 1880 a= ‘8842a'+ 4:28 
Philippine a= *9448a'+ 1:25 Gaboon, 1864 a= ‘893la’+ 3°94 
Javanese a= ‘9359a'+ °82 English a=1-0037a'4+ °27 
Gaboon, 1864 a= ‘9782a’4+ °23 Congo a= ‘9739da'— 36 
English a= ‘9756a'+ 22 “Mixed” Malays a=1:0420a'’— 5°76 
Hindoos a= ‘9665a’+ ‘11 Veddahs a=1'1208d' — 12°71 
Borage aS '9645a' — 1:16 Gibbons, Hylobates a="7083a' 2-41 | 
taboon, 1880 a=1:0045a'-- 2°09 $4+¢9+2 

Gorillas a=1:0750a'-— 3°29 


* The reader will observe (see contours, pp. 428—9) that it is not proper to speak of a catenary form of 
the nose, The English and Gorilla noses are not at all catenary in contour, but the chord and subtense 
relation is on the average clearly that of the catenary chord and subtense. 
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It will be seen that the catenary assumption as to the bridge of the nose, 
notwithstanding the relatively high correlations of a and a’, is not very satisfactory *. 
If it were, the best fitting or regression line would be simply a=a’, Thus the 
regression coefficient should be unity and the constant term zero, This is roughly 
true for English females, less true for English and Hindoo males, and approximately 
the case in Congo females and Gaboon (1864) males. For the anthropoids the 
deviations are very considerable, and this is also true for the Veddahs. The 
catenary form of the nose appears to be an impression largely based on familiarity 
with the higher racial types. Thus the mesodacryal subtense index a can only 
be deduced from the mesodacryal arcual index @ for individuals of the higher 
races. At the same time Table IV shows that the racial value a of a as 
deduced from £ is very little in excess of the true value for a number of races, 
and we conclude that the hypothesis has greater value for interracial than for 
intraracial comparisons. 


(4) On the Simotic and Mesodacryal Indices. 


If we are going to use these indices as a racial scale, it becomes of some 
importance to determine which provides the better classification. Unfortunately 
the simotic index cannot be determined for the Orang-utan. But an examina- 
tion of Table IX shows that the Chimpanzee is placed at one end, the Gorilla at 


* Tt serves excellently, however, as a control of individual measurements, and when a’ is not 
approximately a it is always worth reconsidering the measurements involved. 

+ In order to obtain the values of a’ from those of @ for the eatenary hypothesis in the case of the 
Chimpanzee and Orang-utan, the table published in Biometrika, Vol. vit. pp. 338—9, had to be 
somewhat extended. The values in Table VII below were obtained. 


TABLE VIL. 


Supplementary Tables for Subtense Mesodacryal Index a as calculated 
from the arcual value 8 on the Catenary Hypothesis. 


(A) Values of a. 


| 
B 0 1 2) 3) 4 5 6 y | 98 9 

| 
6 | 153 | 15:4 | 15°6 | 15-7 | 15°8 | 16-0 | 16-1 | 16-2 | 16°3 | 16°5 | 
7 | 166 | 16-7 | 16-8 | 17-0 | 17-1 | 17-2 | 17-3 | 17-4 | 17-6 | 17-7 | 
8 | 17:8 | 17:9 | 18-0 | 18-1 | 18°3 | 18-4 | 18-5 | 18-6 | 18-7 | 18°8 | 
9 | 18:9 | 19:1 | 19-2 | 19:3 | 19-4 | 19°5 | 19:6 | 19°7 | 19°8 | 19-9 | 

10 | 20:0 | 20°1 | 20-2  20°3 | 20-4 | 20°6 | 20-7 | 20:8 20-9 | 21-0 
11 | 21-1 | 21-2 | 21°3 | 21-4 | 21-5 | 21-6 | 21-7 | 21°8 | 21-9 | 29-0 | 
12 22-1 | 22-2 | 22-3 | 22°4 | 22:5 | 22-6 | 22-7 | 22-8 22-9 | 23-0 | 

| 
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the other end of the simotic scale*, while in the case of the mesodacryal indices 
the Chimpanzee and Gorilla although widely separated from each other are both 
placed at the low end of the scale next the Negroes, while the Orang-utan 
appears next the Veddahs at the top of the scale. This appears an arrangement 
at least more in keeping with geographical relations, although we make no 
assertion that similarity in nasal form between Veddah and Orang and again 
between Negro and Gorilla or Chimpanzee is due either to closer descent relation- 
ship, or to a common environment compelling species dwelling under it to 
follow similar nasal developments}. As far as the mesodacryal indices are con- 
cerned, English, Egyptian and Hindoo are closer to the Veddah than they are 
to the Negro, and may in nasal form be considered closer to the Orang than to 


TABLE VII. (continued). 


(B) 
B a 8 a B | a B a B a B a 
| | 

101 | 80°2 | 126 94:0 | 151 | 107°5 | 176 | 120°8 | 201 | 133:9 | 226 | 147-0 
102 | 80°8 | 127 94°5 | 152 | 108°0 | 177 | 121°3 | 202 | 134-5 | 227 | 147°5 
103 | 81°3 | 128 95°1 | 153 | 108°5 4 178 | 121°8 | 203 | 135-0 | 228 | 148-0 
104 | 81°9 | 129 95°6 | 154 | 109:1 | 179 | 122°4 | 204 | 135°5 | 229 | 148°6 
105 | 82:4 | 130 | 96:2 } 155 | 109°6 | 180 | 122°9 | 205 | 136°0 | 230 | 149-1 
106 | 83:0 | 13 96°7 | 156 | 110°2 | 187 | 123°4 | 206 | 136°6 | 231 | 149-6 
107 | 83°5 | 122 97°2 | 157 | 110°7 | 182 | 124°0 | 207 | 137-1 | 232 | 150-1 
108 | 84°71 83 | 97:8 | 158 | 111°2 | 183 | 124°5 | 208 | 137°6 | 233 | 150°6 
109 | 84°6 | 134 | 98°3 | 159 | 111°8 |] 184 | 125°0 | 209 | 13871 | 234 | 151-2 
110 | 85°2 | 1385 | 98°9 | 160 | 112°3 9} 185 | -125°5 | 210 | 138:7 4) 2350) Woe, 
iii: | 85°8 | 186 99°4 | 167 | 112°8 | 186 | 126°1 | 2171 | 1389°2 | 236 | 152-2 
112 | 86°3 | 137 99°9 | 162 | 113°4 | 187 126°6 | 212 | 139°7 | 237 | 152-7 
118 | 86°9 | 138 | 100°5 | 162 | 113°9 | 788 | 12771 | 213 | 140°2 | 238 | 153-2 
| 114 | 87°4 39 | 101°0 | 164 | 114°4 | 189 | 127-7 | 214 | 140°8 | 239 | 153°8 
115 | 88:0 | 140 | 101°6 | 165 | 114°9 | 190 | 128°2 F 275 | 141°3 | 240 | 154°3 
116 | 88°5 | 141 | 102-1 ] 766 | 115°5 | 191 | 128°7 | 276 | 141°8 | 247 | 154°8 
iakif | 89°1 | 142 | 102°7 | 167 | 116°0 | 192 | 129°2 | 217 | 142°3 | 242 | 155-3 
118 | 89°6 | 142 | 103°2 | 168 | 116°5 | 193 | 129°8 | 218 | 142°8 | 243 | 155-8 
119 | 90:1 | 144 | 103-7 | 169 | 117-1 | 194 | 130°3 | 279 | 143-4 | 244 | 156-3 
120 | 90°7 145 | 104°3 | 170 | 117°6 | 195 | 1380°8 | 220 | 143-9 | 245 | 156°9 
121 | 91:2 | 146 | 104-8 | 172 | 118-1 | 196 | 131-3 | 227 | 144-4 | 246 | 157-4 
122 | 91°8 | 147 | 105°3 | 172 | 118°7 | 197 | 131°8 | 222 | 144°9 | 247 | 157-9 
123 | 92°3 | 148 | 105:9 | 173 | 119°2 | 198 | 132°4 | 223 | 145-4 | 248 | 158-4 
124 | 92°9 | 149 | 106-4 | 174 | 119°7 | 199 | 132° 224 | 146°0 | 249 | 159:0 | 
YEAS | 93-4 | 150 | 106:9 | 175 | 1202 | 200 | 133-4 | 225 | 146°5 | 250 | 159°5 


* There is slender evidence also to place the Orang-utan out of position below the Chimpanzee and 
Negroes in the matter of this index. 

+ Dr Hans Friedenthal (Beitridge zur Naturgeschichte des Menschen, Th. v. ‘‘Sonderform der 
menschlichen Leibsbildung,” Jena, 1910, 8. 61) considers the nasal form determined by environmental 
conditions: ‘‘Die anthropoiden Affen als Baumbewohner bedurften keiner Vervolkommung der 
Atemroéhre wihrend der Reduktion des Riechorgans, wie die bodenbewohnenden Menschen und 
Paviane. Die Gorilla, mehr dem Bodenleben angepasst als die tibrigen Anthropoiden, ja, mehr als 
viele Ostaffen und Westaffen, zeigt eine ausgebildetere dussere Nase als Schimpanse, Orang und 
Hylobates.” 
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A Study of the Nasal Bridge in the Anthropoid Apes 


the Chimpanzee, although all the races dealt with are nearer to the Gorilla than 


either of the two apes. 


and two mesodacryal indices : 


Simotic and Mesodacryal Indices. 


TABLE IX. 


Mean 


Values *. 


The following table gives the racial order for the simotic 


Simotie Index (5) Mesodacryal (Subtense) Index (a) Mesodacryal (Arcual) Index ({) 

3 g 3 [ re) ? 
[Orang-utans 0?) | [Orang-utans 172] — — — = 
Chimpanzees 21 | Chimpanzees 19 | Chimpanzees 29 | Chimpanzees 29| Chimpanzees 22| Chimpanzees 22 
Congo 26 | Nubianst 23 | Congo 39 | Congo 37 | Congo 35 | Congo 33 
Moluccas 27 -— Gorillas 41 | Nubianst 40 | Gorillas 36 = 
Gaboon, 1880 29} Gaboon, 1880 23] Philippines 42 Borneo 41 | Philippines 38 = 
Sumatra 29 | Congo 26 | Borneo 42.) Javanese 43 | Gaboon, 1864 40) Nubians + 39 
Celebes 30 — Moluceas 43 | Gaboon, 1880 44 | Moluccas 41 | Gaboon, 1864 42 
Borneo 30 -— Gaboon, 1864 43 | Gaboon, 1864 44 | Gaboon, 1880 42|Gaboon, 1880 42 
Philippines 30} Gaboon, 1864 28 | Gaboon, 1880 43 | Gorillas 45 | Borneo 42 | Borneo 42 
Gaboon, 1864 31) Philippines 29 | Malays 44] Philippines 45 | Malays 43 | Gorillas 43 
Malays 32 — Celebes 45 — Celebes 45 | Javanese 43 
Javanese 34 | Javanese 33 | Sumatra 45 — Javanese 46} Philippines 45 
Nubianst 34) Borneo 35 | Javanese 45 —- Sumatra 47 = 
Ainos 43 — Nubianst 48 —- Ainos 50 — 
Veddahs 44 | Veddahs 37 | Ainos 50 — Nubianst 51 — 
Egyptians 44 | Egyptians 37 | Hindoos 55 | Hindoos 49 | Hindoos 61 | Hindoos 54 
Hindoos 45 | Hindoos 39 | English 57 | English 53 | English 63 | English 54 
English 51] English 47 | Veddahs 57 | Egyptians 53 | Egyptians 65 | Egyptians 56 
Gorillas 57 | Gorillas 55 | Egyptians 58] Veddahs 59 | Veddahs 69 | Veddahs 73 

a = Orang-utans 74 | Orang-utans 79 | Orang-utans 123 | Orang-utans 137 


Now if we compare the results given in this table we see that with one 
exception, that of the Borneo crania, the simotic index is greater in the males than 
in the females; but there is a greater agreement between the sexes—the Hindoos, 
English and Egyptians excepted—in the mesodacryal indices. For the mesodacryal 
index, a, there are 7 cases (judging from Table VIII, p. 403) in which the female is 
greater, as against 7 in which the male is greater; for the mesodacryal index , 
there are also 7 cases for the male and 7 for the female with greater value, and 


* The following are the values for the Gibbons : 


| Simotic Mesodacryal Mesodacryal 
| Index (Subtense) Index (Arcual) Index 
| : Hylobates GOES 9 37 55 

| Gibbons 

| Symphalangus 6+9+2 10 29 43 

| 


+ The position of the ‘‘ Nubians” confirms the view that the men were 
women almost negroes. 


more Egyptian and the 
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the individual cases are the same for a and 8. On the whole it is the lower races 
for which the female has the greater value, and the higher races—in the scale 
of civilisation—for which the male has the greater value. Thus the mesodacryal 
index appears more of a racial and less of a sexual character than the simotic 
index. Roughly also the two sexes give nearly the same racial order, especially if 
we regard the large probable errors (see Table VIII, p. 403) of some of our results. 
Looked at in broad outline we have the nose of the Gorilla closely associated with 
that of the Negro, and the Negro lies between the Chimpanzee and the Asiatic 
races. At the bottom of the scale we have the Orang-utans closest of all to the 
Veddahs, who lie between the Orang-utans and the higher races, so far as such 
are provided by these measurements*. On the whole the racial scale of the 
mesodacryal indices seems far more suggestive than that of the simotic index, 
where the Gorilla is widely separated from the Negro, and is divided from the 
Veddahs by English, Hindoos and Egyptianst. We believe that a measure of 
the mesodacryal index is likely to be of as much value as the determination of 
the simotic index, possibly of greater value. 


(5) On the Absolute Nasal Measurements. 


We are doubtful as to what if any stress can be laid on the absolute 
measurements. Mérejkowsky, however, cites the absolute values of the simotic 
subtenses and it is of interest to compare our results with his. This is done in 
Table X. It will be seen that for Negroes, Malays and “ White races” our results 
are in fair agreement with Mérejkowsky’s; with his Melanesian, Polynesian and 
American groups, we have nothing to compare. There is relatively little difference 


in the order between the two sexes, but the actual height of the nasal bones, with 


; 
the one exception of the Borneo crania, is considerably less for the females. This 
smaller value of the simotic subtense in the female is not true for the chord, 
which in Nubians, Egyptians, Gaboon (1864), Gaboon (18580), and Philippine 
Islanders is greater in the female as will be seen in Table XI. There is very 
little we think to be learnt concerning racial differences in either Table X or 


Table XI. 


The races are mixed in a rather meaningless way—at least we have failed to 
find any clue to their order, although in this respect the simotice chord appears 
worse than the simotic subtense. On the whole we believe little of real value 
is to be learnt from the absolute size of the nasal bones. 


* The Gibbons agree with the Orang for the simotic index, but are nearer to the Chimpanzee than to 
the Orang for the mesodacryal index a, while for the arcual index f their position, like that of the 
Gorilla, is within the human range. 

+ Judged by the mesodacryal indices the Gorilla has a flatter nose than any human race, the Negroes 
approaching most closely ; judged by the simotic index the Gorilla has a sharper nose than any human 
race, the English approaching most closely. At any rate from the standpoint of physiognomy, there 
can be no doubt that the mesodacryal indices express better the general visual state of affairs: see, 
however, the contours, pp. 428—9. 
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TABLE X. 


Simotic Subtense *. 


* The Gibbons lie at the very top of this scale close to the Orang-utan with its practically zero 


value of the subtense. ‘Thus for the simotic subtense we have: 


Males Females | Sex?}+ Mérejkowsky 
Chimpanzees 1°78 Chimpanzees 1°47 — 
Gaboon, 1880 2°30 Gaboon, 1880 2°13 — 
Borneo 2°33 Congo 2°41 — 
Congo 2-45 | Philippines 2°58 Negroes 2°60 
Moluccas 2°48 — — 
Philippines 2°55 Borneo 2°59 Mongols 3°00 
Celebes 2°58 _— -- 
Malays 2°62 aa Malays 2°90 
Gaboon, 1864 2°88 Gaboon, 1864 2°68 — 
Sumatra 2°88 = Polynesians 3:13 
Javanese 299 | Javanese 2°80 _ 
Gorillas 3°31 | Nubians 2°81 — 
Nubians 374 | Gorillas 2°82 — 
Hindoos 3°78 Hindoos 3°02 — 
Ainos 3°80 — American Indians 3°80 
Veddahs 3°93 Veddahs 3:08 Melanesians 3°85 
Egyptians 4°63 | Egyptians 3:97 = 
English 4-68 English 4:16 White races 4:74 
TABLE XI. Simotic Chord t. 
Males | Females 
} 

Gorillas 647 | Gorillas 531 

Malays Ps || — 

Borneo 793 Borneo 7:66 

Gaboon, 1880 830 | Hindoos 7°86 

Chimpanzees 832 | Chimpanzees 8-07 

Philippines 842 | Veddahs 8:28 

Celebes 854 

Hindoos 871 | Javanese 8°58 

Ainos 8°80 a 

Javanese 8°83 Philippines 8°80 

Veddahs 914 | = 

Moluccas 9°36 English 9°10 

English 9:38 | — 

Gaboon, 1864 9:43 Gaboon, 1880 9°46 

Congo 9°80 Congo 9°72 

Sumatra 10:10 Gaboon, 1864 9°88 

Keyptians 10°42 Egyptians 10°84 

Nubians 11°45 Nubians 11°90 | 


large sexual differences. 


oS 
+ 


Gibbons, 


Gibbons, 


Hylobates 
ot PaO eee 


| Hylobates 5) 


S++ 


0-46 


0°62 


+ Apparently Mérejkowsky mixes both sexes together, which is hardly permissible considering the 


USymphalangus 6:00 
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On the Absolute Mesodacryal Measurements. 


(6) 


s the order for the mesodacryal subtense, chord 


give 


The following table 


and are 


E¢.8T snbunjnyduky fi ee ZLEL srnbunjynyduhy 39. subunynydwhe 
OLGT - seangohy iaeat eI cormgonky P+ S+P ‘wow | Se a pes P+ E+ P Swoqquy 
LE SAE | 
iG SUvIqUN SUPIQUN | FE SUvIquN | ZF SUVIQUAT | [9.0 ystpsu 2.6 sueiyds 
0-6€ suviydasiy suviyyddsq | ¢.¢% oBuon | 68% oanony Bat suenacae ae aaa 
B-1E (FOBT) Wooquy USSU | ¢.6Z (FIST) Uooqeyy | GETZ (OT) MOORE)  OL.OT SUvpPaA | ZLLT suvIquy 
8-1 Stepp A SULPPPA —8-1Z (OBST) Wooqey ZEsZ (FOBT) UOoquy ZE.6 (FOBT) Booger | EF-LT sqeppe A 
GE yst[oagy shunlg | 9-0& suviqddoy 8.3% SP)]0/) 09-6 (OST) Hoogeyy | OT-IT shuns 
= 9-6& (OS8T) Wooqery — | 0-83 BIYLUING —-GZ-G shurig | 99-01 SOOPUTFT 
a | 9.ZE VAPVUUNG YSILSUy | 9.7 OSOURAL — GF-OL soury 
0-1 (O8S8T) tooqey) | F-EE osuos SDINMODH | &.1Z sueydésq | FZ-6 SOOPULFT 16-6 (FORT) TOEGREy 
8-0 OSU0D | FE (FST) UOOguE) ooulogq | G.1zZ ysysugq = ; 98.6 ‘ aqeuing 
0-62 soopuly | F-1¢ OSOUVAL — | 12 SvoOonoy | ZZ-G suviqny | 92.6 (O88T) WoOgRe) 
eS = P-1 souly = | &.12 SaqeTaQ | 98.98 SPILWO) | €9.6 asouvAare 
9.9% ___ spyywoy | B18 soopuly]  F.6] QSOUPABL | Z.1Z ooulog — OF-G saqeter 
Ose —- sourddyiyg | 8.0¢ SDPO) = | 1-12 skyeqy 99.8 — sourddyrqg | 2¢.6 080, 
. -, 8-O0€ 89q9[9/) oi | 6.02 Soury Sea ES €E-6B Sve] 
6-1 oaudlog | Z.0€ SVOONOTL | F.GT sautddyiyg | ¢.0z sourddryry = CLG SROONIOTL 
8-16 shunig | T.0€ osutog | 6.81 soopuly] | 1.0 SYVppyA — €9.8 osu0g | 0.6 SD])1L0$) 
a 0-08 SARTRIL | CST syeppeA | ¢.6T Soopuly] | 8L-8 asouraee | 99.8 oatdog, 
9-1 asonvave | [.9Z soniddyiyg | 9.27 saazundunyg | 6.81 saazundunyg | 00.8 oaudlog GFR saurddryiyg 
FIZ = soazundunyy | 0.g%  — saazundunyg | 6-11 ShUDIC) | FCT shui | 60. — saazundwuayg FFG saazundunyy 
— | | 
: P | : £ : p 
ouy axoHy ASNALIAG 


‘squawmainsnayy pohsonposary Jo uapag ywroney 


TIX ATAVE 


408 <A Study of the Nasal Bridge in the Anthropoid Apes 


Now we have already seen that the sexual differences in the mesodacryal 
indices are not very large, but we notice at once that these sexual differences are 
very considerable in the absolute measurements. It becomes rather meaningless 
to assert that the Hindoo’s eyes are closer together than the Englishman’s, when 
the difference in mesodacryal chord is less than that between male and female 
Javanese. They are less for the same reason, namely, because the cranium as a 
whole is smaller, Practically the female is less 


and often very considerably less— 
in every mesodacryal absolute measurement, and we cannot usefully compare 
males and females—except as to absolute size, a sexual difference we are already 
familiar with—unless we get rid of this problem of size by using as we have 
already done the indices. But surely if this be true for sex, it is also true for race. 
Our scales become scales of absolute size and of very small racial significance. 
Tn all probability absolute lengths or breadths of the head would tell just as much 
as is to be learnt from these tables of relative racial sizes of the bridge of the nose. 
In our opinion there is little that has bearing on racial relationships. In, for 
example, the chord scales, it is possible that Veddah and Orang-utans are close 
together because of an ancestral lmk, but the Hindoo appears in this part of the 
scale because of his small size. The association of the Negroes with the Gorillas 
may again be partly ancestral, but the association of English and Javanese in the 
same part of the scale as the Gorillas is probably fortuitous. We are inclined to 
say that very little indeed can be deduced from absolute scales of this kind for 
nasal characters. 


It is often asserted that a characteristic feature of the negro lies in the fact 
that he has his eyes farther apart than other races of man. The average value 
of the mesodacryal chord for the Congo and Gaboon male negroes is 23°5 and for 
English and Kgyptians 21°5 ; for the females, Congo and Gaboon, it is 22°6, and 
for English and Egyptians 20°6. There is thus 2 mm. difference on the average. 
Is it possible for the human eye to appreciate this difference? We are inclined 
to doubt it and believe that it is the marked simotic platygephyrosis (see p. 430 
below) of the negro nose which produces largely the impression of greater ocular 


Value of Ratio 100 x Mesodacryal Chord/Minimum | 
Forehead Breadth. 


Race | 3 | i) 
Congo... SBS 24°6 | 25°1 | 
Gaboon, 1864... 24:0 24°3 
Gaboon, 1880 ... O35 23°5 
Egyptians ae 22°6 | 22°4 
Veddahs ... on 22°0 21:2 
English ... MA 21°9 22-1 
Hindoos ... atte Oils 21:0 
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breadth*. As the total size of the skull varies much from race to race 1t seems 
desirable to compare the mesodacryal chord with another horizontal measure and 
we have taken the minimum forehead breadth: see Table, p. 408. 


It will be seen that the Negroes stand at one end of the list and the Indo- 
Europeans at the other, but again the change of index is relatively small, and it 
may be legitimately doubted whether it would be visually significant. 


(7) On the relative Racial Variability of the Nasal Bridge Characters. 


Tables XIII and XIV give respectively the Standard Deviations and the 
Coefficients of Variation of the characters determined in such series as supplied 
sufficient data for even an attempt at these constants. There is no doubt in our 
minds that the coefficient of variation is the better measure for racial purposes in 
the case of absolute measurements. We will consider first what points arise from 
a discussion of the variability of the mesodacryal subtense, chord and are from this 
standpoint. The relative orders of variability are given in Table XY. 


Again beyond the broad fact that the apes on the whole are far more variable 
than the races of men, perhaps little can be learnt from this table. In the 
apes the female is more variable than the male in 5 out of 9 cases; the same 
ratio of 5 to 4 is maintained in the case of the negro races; in the lower races 
from Borneo and the Veddahs, the males are more variable than the females in 
the ratio of 5 cases to 1, while in the higher races—Kgyptian and English—the 
ratio is 5 cases of the female to 1 case of the male more variable. Altogether 
therefore the female is more variable in 16 and the male in 14 cases. _ Generally 
there is no evidence for greater male variation in these coefficients of variation 
of the mesodacryal absolute lengths. Even if we based our comparison on the 
absolute variations of the mesodacryal characters, we find among the apes the 
male is more variable in 5 the female in 3, with one case—the Chimpanzee 
mesodacryal subtense—of equality; in the lower races (Veddahs and Borneo) 
the male is more variable in all 6 cases; in the negro races in 5 out of 9, but 
in the higher races (English and Egyptian) in only 1 out of 6 cases, Thus in 
absolute measurement the male is more variable in 17, the female in 12 cases. 


* The reader must not judge from Figs. 6—9 of the contours, pp. 428—9. These all have the meso- 
dacryal chord in considerable excess of the negro mean, and were selected to give approximately mean 
angles, not mean absolute lengths. 
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For the simotic chord and subtense we have: 
TABLE XVI. 
Order of Racial Variability for Simotic Characters*. 


Simotic Subtense Simotic Chord | 
3 | 9 3 9 | 
pe. ai ais = | 
Hindoos 21:9 — Egyptians 16°38 | Chimpanzees 10°10 
Gorillas 22°0 | English 18°8 | Veddahs 18-4 | Heyptians 153 
English 22°1  Hgyptians 22°8 | Hindoos 186 — | 
Veddahs 230 | Veddahs 33-4 | English 18:9 | Gaboon (1880) 17:9 | 
Egyptians 23°1 | Gaboon (1864) 34:5 | Chimpanzees 21°9 | English IES HAS) | 
Gaboon (1880) 29:3 | Borneo 36°6 | Philippines 22°7 — | 
Philippines 313 — | Borneo 22°8 | Veddahs 20°7 
Borneo 32°9 | Gorillas 37:0 | Congo 22‘9 | Borneo 25°4 
Malays 36:3 — | Javanese 23°1 — | 
Congo 36:9 | Congo 37°3 | Malays 24'8 — | 
Gaboon (1864) 37:4 | Gaboon (1880) 39:2 | Gaboon (1864) 28:2 | Gaboon (1864) 26:0 | 
Javanese 38 4 — Gaboon (1880) 28°7 | Congo 26°6 | 
Chimpanzees 411 | Chimpanzees  39°2 | Gorillas 31:1 | Gorillas 28°4 | 


Again there is practically little to be learnt, the Gorillas are very variable in 
nasal chord and much less variable (in scale position) on the subtense, while the 
reverse holds for the Chimpanzees, the subtense being far more variable than the 
chord. The Negroes are as a rule low down on the scale, being considerably more 
variable than the English, Egyptians or Hindoos. In 9 cases the female and in 9 
the male is more variable in simotic characters. 


Lastly we pass to the indices, omitting the calculated mesodacryal subtense 
index a. We find for Standard Deviations the order given in Table XVII. 


Again there seems little to be learnt with regard to racial variability from 
these results, beyond the confirmation of what we have learnt from the earlier 
tables that the Orangs, and the Gorillas as a rule, have large variability ; here, 
however,—as distinguished from the coefficients of variation of the absolute 
lengths—the Chimpanzee is seen to have very small variability. If, however, 
we take the coefficients of variation of the indices we find the Chimpanzee is 
associated with the other two anthropoids of our scales in high variability. 
Turning to the sexual differences, we find in the apes as many cases (4) of female 
as of male greater variability; among the human races there are 10 cases in which 
the female is more variable and 11 cases in which the male is more variable, or 
again there is little if any distinction of sex in variability. 


An examination of Tables XIII and XIV will show the reader that much of 
the non-significance of the variation scale for racial purposes is quite possibly due 


* The values for Hylobates, both sexes together, are: simotic chord 14°5 and simotic are 5772. 
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to the smallness of our samples. Except in the case of Egyptians and English, the 
size of these samples was not under our control; we set out to compare the African 
and Asiatic anthropoids with the human races of the same environment, and we 
measured all the crania available, and calculated the variabilities for all series of 
10 and over*. Further the homogeneity of some of the series, as the Malay and 
Javanese, is possibly open to doubt. We have accordingly clubbed together our 
variation results for the coefficients of variation of the absolute measurements, 
with the results given in Table XVIII. 


TABLE XVIII. 


Average Values of Variability of Absolute Measurements on the 
Nasal Bridge. 


With Simotice Lengths Without Simotic Lengths 
| ona 2 3 oa 
| Se ee ee ie |e 
Chimpanzees ... 23°4 224 Chimpanzees ... 18-0 |; 20:8 
Gorillas ioe 20°5 | -23°6 Gorillas fas 16°5 | 17°5 
Orangs coo | == = Orangs wi 163. |) 12°38 
[Hylobates 6+ 9 23°4] [Hylobates +9 15°1] 
Mean 22°5 + Mean 16°9+ 
Malays ree |) LOks — Malays See 256 = 
Javanese oer IS 9/ — Javanese 12°3 — 
Mean 19°8 Mean 12:5 
Congo ... ne 20°0 18°9 Congo ... aa 13°5 10°1 
Gaboon (1864) 19°3 19°8 Gaboon (1864) | 10°3 12°9 
Gaboon (1880) 17°6 18°5 Gaboon (1880) 10°1 11°9 
Mean 19°0 Mean 11°5 
Philippines... 17°5 — Philippines 11°2 — 
Borneo... ie 17°4 18°2 Borneo... ia 10°5 SPT 
Mean 17*7 Mean 10°5 
Veddahs soo pay) 18:0 Veddahs 12°7 11°9 
Hindoos Be, 15°2 — Hindoos si 11°7 = 
Mean 16°4 Mean 12:1 
Egyptians 13°2 | 146 Egyptians 8°7 11°5 
English ae 13°7 | 13:9 English ies 9°3 10°3 
Mean 13°8 Mean 10:0 
con | 
General Means 17-9 | 18-7 General Means 12°5 12°9 
! ——= —— ——t 


* Nubians excepted : these were included at a later date. 
+ Hylobates not included. 
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Whether we take the number of cases in which the female is more variable 
than the male—i.e. 7 out of 9 in the first series and 6 out of 10 in the second— 
or the mean variabilities of the whole series, we see that the female is certainly 
not less variable than the male. 

Massed as above the table seems to show more racial relationship to variability. 
The races group better together, and generally speaking—Veddahs, perhaps, ex- 
cepted—there is decreasing variability as we reach higher degrees of civilisation, 
ie. as we pass from the Apes through East Asiatics and Negroes to Hindoo, 
Ancient Egyptian and English. It is as well to see if these results correspond 
with those for the two mesodacryal and the simotic indices. 

It will be seen in the first place that these results (Table XIX) confirm those 
of the previous table (XVIII) for the variation of the absolute lengths, in that 


TABLE XIX. 
Average Values of Variability of Indices of Nasal Bridge (a, B and 8) 


From Standard Deviations | From Coefficients of Variation 
3 9 | cules 

ia ' is | S L | se 

Chimpanzees ... 68 | 78 Chimpanzees ... | 29°2 | 35:3 

Gorillas 6 14°55 | 14:4 Gorillas oe 315 | 29°8 

Orangs noe 30°8 | 24:0 Orangs 30°70 | 211 

[ Hylobates + 2 9'9] [Hylobates d+ 2 30:4] 

Mean 16°4* Mean 29°5 * 

Malays a 8°2 | = Malays 20°8 = 

Javanese at A 2 | Javanese 27°5 = 
Mean 9°7 Mean 24°2 

Congo ... te Td Ny dae Congo ... ae 22°9 24°3 

Gaboon (1864) 74 | 9-1 Jaboon (1864) 20°4 24°8 

Gaboon (1880) 93 | 94 Gaboon (1880) 25°0 28°3 
Mean 8°3 Mean 24:3 

Philippines... 7:0 — Philip pimesoees | eel OE? mele 

Borneo... ae 10-0 8°4 Borneo... ise 98°5 | 21:9 
Mean 8°5 Mean 23°2 

Veddahs ace 10°3 14:9 Veddahs Ro 18°4 27°7 

Hindoos <a 9°7 — Hindoos ad 18°5 — 
Mean 11°6 Mean 21°5 

Egyptians 9°3 8°9 Egyptians... 16°5 18°6 

English chs 10°6 8°3 English aes 18°9 16°3 
Mean 9°3 Mean 17°6 

General Means 10°9 11:2 |) General Means 23-4 24°8 


* Hylobates not included. 
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they make the female slightly more variable on the average than the male. For 
the standard deviations the male is more variable in 6 out of 10 cases; for the 
coefficients of variation the female is more variable in 6 out of 10 cases. In either 
method of measurement the apes are more variable than men, with the one 
exception of the Chimpanzees in the case of the standard deviations, where the 
small variation is associated with small variation in the Negro group. 


(8) Racial Relationship as based on Nasal Bridge Measurements. 


Having general regard to Tables XV—XIX for variability, and comparing 
the orders therein with those for the racial order of absolute lengths, it seems 
impossible to suggest any linear scale of arrangement which will mark racial 
relationship ; it is impossible to assert that the Negroes or the East Asiatic group 
stand regularly higher or lower in a lineur scale. Still there does appear to be 
some order in the scales both for absolute mesodacryal values and for variabilities. 
Thus the Veddah nasal bridge is more closely related to that of the Orang than 
the latter’s to the Negro group, which on the whole is closer to that of the 
African apes. Our numbers are too small, our probable errors too large, for 
insistance en any individual point, but the measurements lead to constants both 
for index and variability suggesting a scheme of the following type : 


; English 
pak Hindoo 57@ 
of/an §5' 
a ie 
& 
yeddah 
57 
Celebes45 Borneo 
es 42(/!) 
2G, mes Aino 
, Oe Fino) 50 
Gorilla ~~ \? Congo 
MY) eS “Orang 


74 ((4) 


P Gibbon Hylobates 
37 


Chimpanzee " Gibbon Symphalangus 
29 (78) 29(?) 


Tree. Mesodacryal Subtense Index and Average Variability for ¢’s indicated. 


418 <A Study of the Nasal Bridge in the Anthropoid Apes 


On this tree we have placed the mean values of the mesodacryal subtense 
index (a) and the average variability of the mesodacryal lengths*, both for 
males to indicate the degree of correspondence suggested. 


In such a tree as this we see the general tendency of the higher races to have 
(1) a more developed nasal bridge, and (2) a markedly lessened variability. The 
tree is merely diagrammatic and no great stress is of course laid on it, but it does 
suggest that the results reached both for mean values and variabilities may not be 
merely contradictory, but correspond to a real evolutionary order in the develop- 
ment of the nasal bridge, 


(9) On the Correlations of the Nasal Bridge Characters. 


The correlations of the two mesodacryal indices and the simotic index are 
given in Table XX. 


TABLE XX. 


Coefficients of Correlation. Nasal Bridge Indices. 


"ap | eed T So. 
j OT Ses 9! Vas eae 
Congo “an ...|50 *881+°021 | 27 *864+°033 | 50 °882+°:021 | 27 -867+°032)50 613+ :060| 25 -704+ 068 
Gaboon, 1864... w|48 9044 °018 37 *941-4°018 | 48 *905+°018 | 57 945+ °012 | 47 +258+°092| 35 +509 + 084 
Gaboon, 1880... ... | 16 ‘976+ 008 | 18 °959+°013/| 16 -978+-°007 | 18 °959+°013 | 16 *423+°138/ 18 -508+°118 
Philippine Islands salle ioe O78 = Hi, OPT SS ONT || == 17 °374+:141 
Borneo ... ar ... | 26 948+ °013 = 26 -945+4°014 = 26 +426+4°108 
Java and Madura ... | 39 °940+'013 = 39 “944+ °012 | = | 39 °318+°097 = 
Mixed “ Malays” Se = 24 *897 + °027 = 24 °897 + :027 | — 23 :696 + 073 
Hindoos ... aa | LO *929-+ 7029 — 10 *931 + °028 —_— 10 *891 + 044 = 
English ... ... | 50 *955+°008 | 50 +9614 :007 | 50 *956+°008 | 50 °962+:°007 50 -440+:°:077 | 50 +519+:070 
Egyptians ms shee | OU) ‘976 + °005 | 50 "964+ °007 | 50 °975+4:°005 | 50 -964+4°007 50 *444+°077 | 50 -471+4:074 
Veddahs ... See ...| 15 °959+°014/ 12 -971+°011 | 15 -958+°014| 12 -972+°011 | 15 -°291+4°159| 12 :369+°168 
Chimpanzees 33 “O75 +064 | 1S *782+4 °062 |) 363 -674+°064 18 *784+°061 (13 :062+°186 — 
Gorillas ... sia wee | 27 889+ °027| 25 9928+ °019 | 27 *888+°027 25 :924+ -020 | 21 "740+ 067/19 °464+°:121 
Orang-utans ves ...| 44 °839+ °030| 13 °720+°090| 44 °844+4°029 13 -715+:°091 | — = 
Gibbons, Hylobates 8 + 9| 25 812 + 046 25 "815 + 045 16 "199 4-162 
| | 


<2 Le Nat RS 2 pa 


ii 


We haye already (see pp. 399—401) discussed the relation of a to a and seen 
that it does not lead us to any very marked racial order. The correlation of the two 
mesodacryal indices gives a somewhat clearer result, if we group the correlations 
thus: 


* Coefficients of variation of arc, chord and subtense. 
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Yap in Groups. 


) s 
Apes <3: ves “801 “810 
East Asiatics ... 871 897 
| Negroes ... ache “920 | 931 
Veddahs... fos 959 ‘971 

| Hindoos ... Ree "929 | —_ 
Egyptians ) : <6y55) 
English if eee 965 962 
All values, means “892 899 


The suggestion is that there is increased correlation with higher civilisation, 
the Veddahs being somewhat out of rule. The females are slightly, but very 
probably not significantly, more highly correlated than the males. There is 
nothing out of keeping in this correlation order with that for the means and 
variations exhibited in the “tree” on p. 417, but it is too slender to give it much 
support. If we turn to the correlations of simotic and mesodacryal (subtense) 
indices rs, we find a greater range of variation and thus more possibility of 
getting an order of some racial interest. 


‘gar Correlation Order. 


| | 
| J | 
Chimpanzees... ‘06 | — 
[ Hylobates nee 20 (6 + ¢) —-] 
Gaboon, 1864... "26 ‘51 
Veddahs ... we 29 37 
Javanese... ae 32 — | 
Philippines ths 37 _- | 
Gaboon, 1880... “42 51 
Borneo... cs “43 — 
Malays ... tye — ‘70 
English ... ie “44 "52 
Egyptians - ise “44 “47 
Congo... ae ‘61 ‘70 
Gorillas ... aoe “74 “46 
Means “40 53 | 


With the single exception of the Gorilla the females have in every case, and 
often markedly, more correlation than the males. But the results are extremely 
irregular, even if we allow for the small numbers dealt with. Accordingly it 
seemed worth while to deal more at length with the relation between the simotic 
and mesodacryal characters, and further correlations were worked out. It must 
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be borne in mind that the mesodacryal subtense consists of two portions, the first 
part due to the nasal bones is the simotic subtense, the second part is due to the 
maxillary bones. If we subtract the simotic subtense from the mesodacryal sub- 
tense (DS) — (SS), we have a measure of the part of the nose due solely to the 
maxillary bones, and we may correlate this with (SS) the part due to the nasal 
bones. There will be no spurious correlation in this as in correlating (DS) and 
(SS) directly. Further we may form an index {(DS)—(SS)}/(DC) and correlate 
this with the simotic index (SS)/(SC), although in doing this (SC) must be 
considered in a certain sense as contained in (DC). The spurious correlation, 
however, is unlikely to be as great in this as in (SS) and (DS), for the maxillary 
walls of the nose can approach so close that there is no (SC) at all, as in the case 
of many Orangs and in some Negro skulls*. 


DS-SS 


Fra. (i). 


(10) Lhe Maaillary and Simotic Nasal Angles, 6 and q’. 


The value of DO —SC may be used to obtain a measure of the flatness of the 
maxillary walls of the nose. Clearly (see figure above) 


tan 6 =(DS — S8)/4(DC — SC). 


We term ¢ the maxillary nasal angle and its complement 90°—q measures 
approximately the angle the maxillary wall of the nose makes with the median 
plane of the skull. Table XXI records the values of the nasal maxillary angle ¢ 
for the races dealt with; they are obtained not from the calculation of @ from 
individual crania, but by the rougher process of using the mean values of the 
mesodacryal and simotic subteuses and chord in the above formulat. The table 
emphasises again the relation of the Veddahs and the higher races to Orang and 
Hylobates, while it indicates the relation of the Negro races to the African anthro- 
poids. At some points the female order is not in accord with the male, but the 
smallness of many of the series dealt with seems sufficient to explain the source 
of these differences. Some races have clearly a marked sexual difference, but it is 
not always one way and the means show no very great difference; more data and 
longer series would be needful to be certain of a persistent sexual differentiation. 


* B.g. Gaboon 1864, Nos. 10, 11, 81, and Congo 70: see Biometrika, Vol. vit. p. 319, and see 
especially the remarks in Tables of measurements. In one gibbon the simotic subtense was actually 
negative or the nasal bones depressed. 

+ Table XXIV a gives the values of ¢ and their variabilities for a small number of selected races. It 
will be seen that the differences due to the two methods are not great. The short method will give 
results close enough for any conclusions which are in the least likely to be drawn from such small series. 
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TABLE XXI. 
Maxillary Nasal Angle ¢. 


| 
re) 4 
i eee - = = z} 
Orang-utans Dbz" | Orang-utans 575° 
Egyptians 54°3° | Veddahs 5671" 
Veddahs 53°8° Keyptians 54°3° | 
| Hylobates (f+) 52°5°] |. [Hylobates (J+) 52°5°) | 
Hindoos 2.0m Gaboon, 1880 50°3° | 
Nubians ilesy English 49°3° 
Sumatra 513° Philippines 490° 
English 50°8° = _-- 
Ainos 47°6° — — 
Moluccas 47°0° _ — 
Celebes 16°9° Gaboon, 1864 48°5° 
Javanese 460° = = 
Gaboon, 1864 ANS SF Hindoos 48°4° | 
Gaboon, 1880 45°2° Nubians 48°2° 
Malays 44°7° —- a 
Philippines 44°6° | Javanese 44°8° 
Borneo 44-4° Congo 42°5° 
Congo 44°1° | [Symphalangus 42°3°| 
[Symphalangus (¢ +9) 42°37] | Borneo 42°0° 
Gorillas SDole | Gorillas 39°'1° 
Chimpanzees 34°8° | Chimpanzees 37°8° 
Mean 47°1° Mean 47°7° 


The smaller d, the flatter the maxillary part of nasal bridge. 


If we now turn to Table XXII we find a number of additional correlations, 
which are sufficiently interesting to make one appreciate how much more valuable 
they would have been had we had longer series and more variety of races avail- 
able. We have first correlated the mesodacryal and simotic portions, DS— SS 
and SS, of the whole subtense of the nasal bridge. In every case dealt with, male 
and female, these subteuses are negatively correlated. On the other hand the 
correlation of the mesodacryal subtense with the simotic subtense 1s always posi- 
tive and often substantial*. This positive correlation is therefore due to the 
appearance of SS in both variates, i.e. it is a spurious correlation. Actually when 
the maxillary part of the nasal bridge is larger, then the simotic part is smaller. 
Intraracially—that is among individuals of the same race—-there is a compen- 
satory growth of the nasal bridge and those who develop large nasal bones have a 
smaller maxillary nasal development. This point seems to us of very great im- 
portance; the tendency now-a-days is to insist on the importance of anatomical 
unities. This has largely arisen from two sources. In the first place craniology 


* For the mesodacryal subtense with the simotic subtense, all the correlations except that for the 
male Chimpanzee are positive and often substantial, the intraracial correlation amounting to + ‘34 to 
+°38. Further the interracial correlation is high indeed, being +°81 to +:87. But as we see these 
results are in part spurious, 
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has been chiefly in the hands of the pure anatomist rather than in those of the 
student of philosophical evolution. The former has insisted on the importance of 
growth centres and anatomical wnity of measurement; he has usually no appre- 
ciation of the relatively high correlations of the parts of the bony structure. He 
is thus liable to overlook the fact that from the standpoint of evolution a complex 
anatomical organ may be of far more importance to the race and to the individual 
than one factor of it which may be an anatomical unity. In the second place 
Mendelism has become the mode, and to surmount difficulties about characters 
which do not “ mendelise,” it has been customary to assert that they consist of a 
complex of simple Mendelian units*. On the original Mendelian theory such 
“units” were asserted to be independent, although as such a theory was found to 
be unworkable, a crude theory of “coupling,” as a measurable association of 
anatomically or physiologically simple unities, was evolved to cover the real facts of 
correlation. In this manner there has arisen a conscious or unconscious association 
of anatomical and Mendelian units, and it has been supposed that an anatomical 
unity would be more likely to “mendelise” than a combination of such unities— 
a single bone like the femur than a measurement like the leg length or the stature. 
The present result seems an illustration of the futility of neglecting the correlation 
of characters or treating them as compounded of independent Mendelian or even 
anatomical unities. The bridge of the nose consists of parts which are distinct 
anatomical unities, but the evolutionary factor is probably far better represented 
by the physiognomic factor—the whole bridge of the nose—than by any anatomical 
unit. When one part exceeds there is a correlated defect in the other part, and 
this rule extends practically through all the races examined. It is thus quite 
conceivable that a single measurement like the sagittal arc is from the evolu- 
tionary standpoint far more important than any of the simple ares—opisthion to 
lambda, lambda to bregma, and bregma to nasion—into which we may reduce it by 
aiming at the measurement of anatomical unities. There is such a thing as an 
evolutionary factor, a determinant which controls the development of a whole 
series of anatomical unities, and the heredity brought into evidence by such 
a factor may be far more important from the standpoint of evolution than the 
heredity of any simple anatomical character. What group of anatomical units 
go to form any such determinant can only be appreciated by a thorough study of 
the correlations of simple anatomical parts. 


Especially in the nasal bridge is this matter of very great importance, for 
while within the race an individual with small maxillary height (DS — SS) has a 
large simotic height (SS), ¢nterracially this is no longer true, the race with a 
small maxillary height will have a small simotic height and the positive correla- 
tion interracially is very substantial (+°581)—this must be compared with the 
mean intraracial correlation which is —°333. Both these results are for males, for 
females the numbers are +°501 and —:226 respectively. This reversal of sign 


* Of course when it is convenient a vast congeries of factors, such as ‘‘ Albinism ” or ‘“ Jewishness,”’ 
are illogically treated as a simple Mendelian “unit” ! 
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when we pass from intraracial to interracial correlations is a thing previously not 
unknown. Thus in 1903 in a paper* on interracial and intraracial correlations, 
Jacob, Lee and Pearson showed that while with individuals a long cranium is likely 
to be a broad cranium, yet interracially a compensating factor comes in, a long- 
headed race is likely to be a narrow-headed race. Thus it is quite conceivable that 
capacity of the cranium—depending on many anatomical units—may be of far 
more evolutionary importance than the measure of any single “anatomical unit” 
of the skull. We have referred at length to these matters here, because there is 
a growing idea—fostered probably by the idea of Mendelian “units,” that the 
measurement of “anatomical units”—or the measurement between “anatomical 
points” —is of primary importance in craniology. Whether craniologists measure 
the same thing or not is immensely important, the correlation between anatomical 
units 1s also of great value as determining what combinations of simple units form 
evolutionary factors. But very little service is done by insisting too largely on 
anatomical unity in and for itself. Evolution depends largely on physiological 
fitness, and organs of physiological importance are rarely compounded of either 
single anatomical or of single Mendelian units. 


If we now turn to the index correlations of (DS —SS)/DO—ie. height of 
maxillary portion of nose by its breadth—with SS/SC— ce. height of nasal bones 
portion by its breadth—our results are less uniform, Putting the male Gorillas on 
one side for a moment there are only two of the correlations which can be said to 
be significant, namely, possibly the male Chimpanzee and the female English, and 
these are of opposite signs. The means for male races and for female races are 
— 065 and +011, and these would be of no service for prediction. With regard 
to the Gorilla result, this arises principally from three males with the very high 
simotic indices of 88, 106 and 103. It is conceivable that the material is not really 
homogeneous; the arithmetic has been carefully repeated without change in 
values. If we take these indices to measure the cuneal or wedge-shaped properties 
of the two portions of the nose, we should say that individually there is very little 
relation between the ratio of height to breadth in the simotic and maxiilary 
portions of the nasal bridge; the only exception to this rule being the male 
Gorilla. On the other hand the cuneal characters are positively and sensibly cor- 
related (+°370 to +°393) interracially, an obtuso-cuneal nasal part being also 
associated with an obtuso-cuneal maxillary part and an acuto-cuneal nasal. part 
with an acuto-cuneal maxillary part. 


We have already discussed the maxillary nasal angle ¢, the complement of 
which roughly measures the angle between the maxillary wall of the nose at the 
bridge and the sagittal plane of the skull. We can introduce a similar angle ¢’ 
to measure roughly the angle between the nasal bone and the simotic chord. We 
may take (see Fig. (1), p. 420): 

tan ¢' = SS/($ SC) 
=2 Simotic Index. 


* Biometrika, Vol. 1. p. 355. For the first introduction of the ideas of intraracial and interracial 
correlation, see Biometrika, Vol. 1. p. 460. 
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It will be of interest to investigate whether the correlations of ¢ and ¢’ are 


higher than those of the cuneal indices. 


The smaller q' the flutter the simotic portion of the nasal bridge, ve. the 


TABLE XXIVa 


TABLE XXIII. 


Simotic Nasal Angle ¢’. 


| Z 9 
Wiss ieee 
| Gorillas 48°6° Gorillas ia 
English 45°5° English 430° 
Hindoos 41°8° Hindoos 38°0° 
Egyptians 41°6° Egyptians 36°5 
Veddahs Allele Veddahs 36°4° 
Ainos 40°9° | = = 
Nubians 34 3e | Borneo 34°8 
} Javanese Sule | Javanese apie 
| Malays 340° | = 
Gaboon, 1864 31°7° Philippines 301 
Philippines ay t aah Gaboon, 1864 29°0° 
Borneo oulilie Congo 27°2° 
Celebes 30°7° — — 
Sumatra 30°3° = = 
Gaboon, 1880 30°1° Nubians 95°2° 
Moluccas 28°0° = = 
Congo Difiailis Gaboon, 1880 24°6° 
Chimpanzees — 232° Chimpanzees — 20°3° 
tSymphalangus 11°5° tSymphalangus 11-5" 
+ Hylobates 98° + Hylobates Sh 
Mean OoLon Mean 29°8° 


Egyptians 
[ Hylobates (4 + @) 
English... 
Javanese ... 
Gaboon, 1864 
Congo 
Gorillas 
| Chimpanzees 


MALE 
Mean Standard Coefficient of Mean 
Deviation Variation 

suilkepier yi/ ee 97+ -40 | 1099+ +75 | 54-294 -53 | 
53°0641°50 | 8°90+1:06 | 16°78+2:° 06) — 
HOLGoces Dou oer4te 259 | 1135-278 49-1b-e -4'8 
4606+ -91] 8°43+ :64 | 18°31+1°44 —- 
4594-9253 | bist “37 | TE63 782 | 49°19 “64 
aaNet (Don obese “30 | 12502 486 43°304. «71 
3612+ ‘96 | 6564 ‘68 | 18°15+1°95 | 39°92+1:°10 
35°96+1°80 | 9°6141°27 | 26°74+43°78 | 34°39+1°82 


greater the 


PrEMALE 
Standard 
Deviation 
SED Oke Voit s 
500+ +34 
DO2ete 4D) 
Deh tes O0) 
T11L+ -78 | 
sya Sy) | 


* These values of ¢’ are calculated from the mean racial simotie index. 


is given for a few races as well as the variabilities for these selected races, 


directly and indirectly calculated means is ante: reasonable, 
+ Both sexes together. 


The following table* (XXIII) gives the 


simosis. 


Maxillary Nasal Angle ¢. Mean and Variability. 


| 
Coefficient of | 
Variation 


ae pile 7K) 
10: 17+ °69 
11: 434 93 


ID1GHEIIS | 
17°82+2°01 
23°60 43°96 


| 


In Table XXIV 8 the 
mean value of ¢' as obtained from averaging the simotic angles of the individual members of the race 


The agreement of the 
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interracial values of ¢’. 


interesting points arise as we shall show in the next section. 
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Tt will be seen of course that the order must be the same 
as that of the simotic index, but if we compare this table with Table X XI some 


Tables XXIV A 


and XXIV B give the means and variabilities of the maxillary and simotic angles 


gd and ¢’. 


TABLE XXIV es. 


Simotic Nasal Angle 6’. Mean and Variability. 


| Egyptians ae 
| Hylobates (4d +9) 
English wee 
| Javanese ... 
/Gaboon, 1864 

| Congo 

| Gorillas 
Chimpanzees 


41°25+ 
O75 + 
44-7O+ 
33°23) 
SOs 
2662+ ° 
46°48 + 1° 
22 tif td 


Mann FrMALE 
Standard — Coefficient of Mean Standard Coefficient of 
Deviation Variation Deviation Variation 
| 

4:°70+ -32)11-40+ -78) 35-974 +57] 6:°02+ -41 | 16°73+1°16 
497+ 59) 50°99+7°50] = — —- 
658+ 44] 14°734+1-01 | 42°58+ -47) 4:°96+ °33] 11°65+ -80 
815+ 62 | 24°54+1°98 = — — 
734+ ‘51 | 23-5641°73 | 28-464 85 | 7°42+ ‘60 | 26°0842°24 
TOS+ 52) 28°8642°10 | 26°484+1:°20) 891+ +85 | 33°634+3°55 
10°30 41°07 | 22°1642°42 | 45°63+1°74 | 11 934+1°93 | 24°6142°85 
ceileets 32°1044°66 | 20°0041°73 | 7°7041°22 | 38°51 46:97 
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ft would appear from Tables XXI and XXIIL that ¢’ has a greater interracial range and 


from Tables XXIV a and B that it has a greater intraracial variability than @¢. 


What we need 


for a good racial character is a great interracial range and a small intraracial variability. We 
cannot therefore say that ¢’ is a better racial character than @. 


(11) 


We need some new terms. 


General Classification of Nasal Bridges by 6 and $'*. 


We propose to term races which have high values 


or low values of both @ and $’ homoclinic, while those which have low values of 


one and high of the other are heteroclinic. 
heteroclinic, and Hylobates is the same, but in the opposite direction. 


Among apes the Gorilla is markedly 


The 


Chimpanzee and Symphalangus are both homoclinie and so are the English, 


Egyptians, Hindoos and Veddahs. 


ithout regard to the exact numerical values and as merely descriptive terms 
Without regard to the exact numerical val d ly descriptive term 


the following types of Gephyrosis will indicate the nature of the characterisation, 
where it must be remembered that the reader has to fix his attention on the 


* Professor G, D. Thane has kindly made the following suggestions for a more complete angle 
nomenclature of the nasal bridge (see Fig. (i), p. 420): 


Lateral simotie 


Mesial simotic angle 


angle 


=our simotic nasal angle ¢’. 


=our 90° — ¢’=angle between simotic chord and median plane. 


Simotic angle (median) = our 180° — 2¢’=internal angle between two simotic chords: 


Lateral maxillary angle=our @¢=maxillary nasal angle. 


Maxilli-simotic angle 


=our 180°- $+ ¢’=internal angle between simotic and maxillary chords. 


The most important point here is the maxilli-simotic-angle, the approach of which to 180°, i.e. the 
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lower parts of the “bridge” from its ‘“springings” at the dacrya to the strokes 


marking the naso-maxillary sutures. 
14 23 34 22 
Hypoleptogephyreal 


27 
Hyperplatygephyreal Platygephyreal 
(approaching 


Mesogephyreat Leptogephyreal 
Orthogephyreal) 


Fic. (ii). Types of Gephyrosis. 


Similarly we have for the part of the “bridge” from “haunch” to “ haunch,” 
ie. the nasal bones, or part between the strokes marking the naso-maxillary 
sutures, the following characterisation : 


Hypersimotic Simotic Mesotie Stenotic Hypostenotic 


Fic. (iii). 


Types of Simosis. 


For classificatory purposes it may be useful to divide @ into three classes : 
when ¢ is under 46°, the group shall be called platygephyreal, when @ lies 
between 46° and 51°, the group is mesogephyreal, and when ¢ is over 51°, the 
equality of @ and ¢’, marks the ‘ triangularity” of the bridge, or the unbroken character of the nasal 


wall. 
Maxitli-simotic angle 180°-oh+¢q". 


| 
3 | f 

| 
Gorillas OBES? | Gorillas 188°7° | 
English 174°7° English 13 72 
Ainos 173-3 —- 
Hindoos 169°8° Borneo 172°8 
Malays 169 °3° — = 
Chimpanzees  168°4° Hindoos 169°6° 
Javanese 168°1° Javanese 168-4° 
Veddahs 167°3° Congo 164:7° 
Egyptians 167°3° Chimpanzees —162°5° 
Borneo 166°7° Egyptians 162-2 
Philippines 166°7° Philippines 161:1° 
Gaboon (1864) 166°0° Gaboon (1864) 160°5 
Gaboon (1880) 164-9° Veddahs 160°3° 
Celebes 163°8° —- — 
Congo 163°0° Nubians 157°0' 
Nubians 163°0° Gaboon (1880) 154°3' 
Moluccas 161°0° — — 
Sumatra 159°0° — — 
Symphalangus 149°2° Symphalangus 149°2° 
Hylobates 137°3° | Hylobates 137°3° 


Only the Gorillas have a re-entrant racial maxilli-simotic angle, although such angles occur in individual 
cases of other races. Of the races considered the Knglish appear to have most nearly an unbroken 


nasal wall, The racial order is not clearly marked as Negroes and Asiatics are much intermixed. 
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TYPES OF THE NASAL BRIDGE. 


yo aX ten a. 
1 2 1 bis 2 bis 


Dog. Nasal Bridge. Dog. Muzzle Section. 


1—2 bis=Hypersimotic Platygephyrosis. 


(i ea 


3 4 5 
Chimpanzee. Chimpanzee. Symphalangus. 
6 7 8 9 
Congo Negroes. Gaboon Negroes. 


3—9=Simotic Platygephyrosis. 


Malay. Malay. Malay. 
10—12=Mesotic Platygephyrosis. 


aS a 


13 14 
Gorilla. Gorilla. 
13—14=Stenotie Platygephyrosis. 


It has not always been possible to select a nasal bridge with the mean values of both ¢ and ¢’, and the 


typical form will hardly be realised even from two or three cases, 
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TYPES OF THE NASAL BRIDGE. 


a a 


15 16 17 18 
Hylobates. Hylobates. Hylobates. Orang-utan. 
15=Hypersimotic Mesogephyrosis. 16—17 = Simotie Leptogephyrosis. 18=Simotic Leptogephyrosis. 


7 ian a 


Natives of Moluccas. Natives of Sumatra. 
19—20=Simotic Mesogephyrosis. 21—22=Simotie Leptogephyrosis. 
23 24 25 26 
Aino. Aino (Variant). Nubian. Nubian (Variant). 
23—24=Mesotic Mesogephyrosis. 25—26=Mesotic Leptogephyrosis. 
Ze 29 31 33 
English. Veddah. Egyptian. Hindoo. 
28 30 32 34 
English. Veddah. Egyptian. Hindoo. 
27—28=Stenotic Mesogephyrosis. 29—34=Stenotic Leptogephyrosis. 


It has not always been possible to select a nasal bridge with the mean values of both ¢ and ¢’, and the 
typical form will hardly be realised even from two or three cases. 


Biometrika 1x 55 
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group is leptogephyreal. In the same way the simotic nasal angle ¢’ may be 
classed as s¢motic when under 32°, mesotic between 32° and 41°, and _ stenotic 
when over 41°. These limits should all be reduced by about 2° in classifying 
the female simotic nasal angle, which differs sexually by at least this amount 
from the male. We have accordingly the following table, of which we give illus- 
trations to indicate the type. The diagonal from simotic platygephyrosis to 


TABLE XXYV. 

Value of Maxillary Nasal Angle ¢. 
=| | Under 46° | 46° to b1° Over 51° 
2 -_———} So OAS | ea 
ee | | Simotic Simotic Simotic 
<q | Under | Platygephyrosis Mesogephyrosis Leptogephyrosis 
= | 32° e.g. Chimpanzees e.g. Natives of | eg. Hylobates and 
a and Negroes Moluccas | natives of Sumatra 
7c ee : a 
ea | Mesotic Mesotic Mesotic 
o 32° to 41° | Platygephyrosis | Mesogephyrosis | Leptogephyrosis 
= e.g. Malays e.g. Ainos e.g. Nubians 
7p) 
fe Ss 7 nie ee ews |e i = 
. Stenotic 
i) Stenotic Stenotic A . 
= 0 : , Leptogephyrosis 
ta yer 4i aby yLOSIS Mes "OSIS or. 
a Over 4 | 12h pe ee Mesoeenhonis e.g, Egyptians and — 
| e.g. Gorillas e.g. English | Tindoos 


stenotic plagygephyrosis marks the homoclinic groups. The simotic leptogephy- 
rosis of the smaller gibbon and the stenotic platygephyrosis of the gorilla mark 
the heteroclinic extremes, which weaken the interracial correlation of ¢ and ¢’. 
Table XXVI gives the values of ryy. 


With the exception of the Gorilla, which is as in the case of the cuneal indices 
(p. 424) again anomalous, we see that there is intraracially a small negative corre- 
lation between flatness of the maxillary and flatness of the nasal bones. But 
imterracially it is otherwise; there is quite a moderate degree of positive correla- 
tion and a flat nasal part of the bridge is found in the same races as a flat 
maxillary part. This is another instance cf the reversal of intraracial by inter- 
racial correlation. 


(12) Discussion of Projections of Nasal and Maaillary Portions of Nose on 
the Base of the Nasal Bridge. 


Having studied the relation of the subtenses, the indices and the angles, we now 
turn to the chords. The mesodacryal chord is positively correlated with the simotic 
chord for all races except male Chimpanzees and the mixed data for Hylobates, but 
in both these cases the correlations are insignificant having regard to their probable 
errors; see Table XXII, p. 422. Both interracially and intraracially a broad maxillary 
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TABLE XXVI. 
Correlation of Maaillary and Simotic Nasal Angles 


gd and ¢’. 
| 
3 2 
eae =e ead | < = 
Congo ... os a. — ‘037 + 095 — 127+ °133 
Gaboon, 1864 ne — 338 + ‘087 — '095+°113 
Javanese Ne ae —°054+4:°108 | — | 
English Pr of — +169 + 093 —*005+:095 | 
Egyptians ... ~... | +°0534°095 | -—-183+4-092 | 
le nual. = A We = 
| 
Gorillas ne aes + °447+°118 = lyf eile 
Chimpanzees... ie — 329+ °167 — ‘947+ °211 
[Hylobates ($+?) ... | --0584°168 | —-058+-168] 
Intraracial Mean a — 061 | —'146 
pense es oP Bs! | i 
Interracial Value* ... +°3534°139 | +4:°220+°178 


portion of the nose would thus signify a broad simotic portion. But the whole 
result is, as that for the subtenses, spurious and arises from the simotic portion 
recurring in the mesodacryal chord. If we subtract the simotic chord from the 
mesodacryal then the difference (DC — SC) is negatively correlated with the simotic 
chord SC, and the correlation is fairly substantial both interracially and intraraci- 
ally. In other words, whenever a race or an individual is found for which the 
nasal bones have a big projection on the base of the nasal bridge, then the 
maxillary bones have a small projection, and vice versd. Thus individually and 
racially a principle of compensation appears at work. Now we have seen that 
intraracially @ is negatively correlated with ¢’, or a steep maxillary portion is 
associated in the individual with a flat nasal bone portion of the bridge. Thus 
intraracially it is conceivable that even if the breadth of the nasal bone and the 
breadth of the maxillary wall of the nose were positively correlated, the projections 
might well be of opposite signs. But we cannot use this explanation to account 
for the negative interracial correlation of the projections, for here the maxillary 
and simotic nasal angles are positively correlated. In the case of intraracial 
correlations not only are the projections 4SC and 4(DC — SC) negatively correlated 
but also the subtenses SS and (DS—SS). Thus it is not improbable that in the 
individual a small nasal bone is associated with a large maxillary wall of the 
nose—a feature we may speak of as the Orang-utan character. But this does 
not hold interracially, for the subtenses SS and (DS— SS) are positively correlated 
(Table XXII) and substantially so for both sexes. It is of some interest to con- 
sider whether interracially the nasal bone and the maxillary wall of the nose are 


* Hylobates excluded, 
55—2 
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positively correlated. In order to test this point a little more definitely, let us 
suppose SB to be the breadth of a nasal bone and DB the breadth of the maxillary 
wall at the bridge. <Approaimately : 


SB =V1(S8C + (SS)! =(SS8)/sin 4’, 
’ DB = 1(D0— 8O¥ + (DS— SS)? = (DS — SS)/sin $. 


Table XXVIT gives the values of SB and DB found for each race from the 
means of that race for SC, SS, DC and DS. Of course the nasal bone and the 


TABLE XXVII. 


Calculated* Values of SB, the breadth of either nasal bone at the bridge 
and DB, the breadth of the maxillary portion of the nasal wall at the 


bridge. 
3 2 
Race See “= a 
SB DB SB DB 

Congo os wee 5°44 9°83 Oso 9°21 
Gaboon, 1864 see oe 5°48 9°82 5°53 9°53 

Jaboon, 1880 re ike 4°59 10°54 5:12 9°71 
Philippines aoe a 4-91 8°36 514 8:06 
Moluccas... ah ae 5:28 9:12 -— — 
Borneo Ne ae ates 4°51 | 9°33 4°54 8:09 
Malays ae se ss 4°69 9°40 — — 
Celebes 5:05 9°34 —— = 
Sumatra byerAll 8°94 — —- 
Javanese DES 9°23 511 7°64 
Nubians 6°64 10°22 6°60 8°60 
Ainos 5°80 8°96 — — 
Hindoos 5°67 8°73 4°91 8°32 
English 6°56 9°56 6°10 8°77 
Egyptians ... 6°97 9°49 6°67 8°39 
Veddahs 5:98 9°29 5°19 9:17 
Chimpanzees 4°52 6°41 4:24 5:91 
Gorillas 4°41 9°97 3°81 9°61 
Orang-utans = [13-57] — [11:02] 
Hylobates ($+ ‘a 2°70 4:27 2°70 4:27 
Symphalangus (¢ + ON 3°11 4:86 3:11 | 4:86 


maxillary wall are often, as our contours (pp. 428—9) show, considerably curved, 
but the chord of these arcs will indicate, at any rate approximately, what is the 
nature of the interracial correlations. Omitting Hylobates and Symphalangus as 
unsexed and the Orangs we find : 


Interracial correlation of nasal bone and maxillary wall 


Males +°'238, Females +215. 


* These values are really the hypotenuses or chords corresponding to the nasal and maxillary 
portions of the bridge of the nose, and these portions are often curved. 
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It appears therefore reasonable to assume that racially, where the nasal portion 
is large, so also is the maxillary portion of the nose. Notwithstanding this and the 
positive correlation of the simotic and maxillary nasal angles, as well as of the 
subtenses, the projections of the nasal and maxillary portions on the base of the 
nasal bridge are negatively correlated. The algebraic explanation of this paradox 
is given below*, but it seems probable that its physiological explanation hes in the 
relative constancy of the mesodacryal chord. If the reader will examine Table XIV 
he will see that the variability of the simotic chord relative to its size is almost 
twice that of the mesodacryal chord, and although some of this may be due to the 
relatively greater difficulty of accurate measurement the bulk of it is not. The 
following results will indicate the relative stability of the mesodacryal chord. 


They are deduced from Tables VIII and XIII, males. 


Mesodaecryal Simotic 
Chord Chord 
Interracial Mean 21°6 8:47 
Mean racial s. p. 2°44 1:97 
Interracial s. D. Ie3i7/ 1:20 
é Mean racial s. D. 
to: 11°3 23:3 
* Interracial Mean 
Interracial s. D 
100 : - 6°34 14:13 
ee Interracial Mean 
Oi ee Soe 23°5 58:9 


Interracial Mean 


Whichever method we take to measure the variation we see that the meso- 
dacryal chord is far less variable than the simotic chord. The breadth of the base 
of the nasal bridge is relatively constant. The nasal bridge has to span the nasal 
base, and whether we deal with the problem from the interracial or intraracial 
standpoint we realise that the nasal structure has to be considered as a whole, and 
that its anatomical units are very far from being necessarily evolutionary units, or in 


* The problem is algebraically of the following kind: given two right-angled triangles ABC and 
A’'B'C’ with C and C’ for right angles, then with the usual notation c?=a?+b?, c’%=a’2+b”%, is it 
possible for c and c’, a anda’ and A and 4’ to be positively correlated together and yet b and b’ to be 
negatively correlated? Clearly if the means be denoted in the usual manner, we have approximately : 


bb =¢ de — ada, b' 6b’ =c' bc’ — ada’. 
Hence multiplying together, summing for all possible pairs and dividing by their number, we have 


DY! yoy yp! HEC FoF el Veet HAL CyFy Maal CU FoF WN eg! — CAT Y Fal e'as 
Hence if 7,,, and r,,, are positive as the hypothesis supposes, and at least one or both 7,,, and ry, are 
positive also, the negative term or terms on the right may exceed the positive and 7, be negative. In 
the case of the simotic and maxillary nasal triangles both 7,,, and 7, are positive, and the last pair of 
terms on the right is wholly negative and in excess of the first pair. The above result depends of course 
on c2?=a2+l2 and ¢2=a? +b” being nearly true. These give ¢2=84-83 against actual 84°64 and c’2= 29-68 
against 29-16. These are close enough to justify the use of the above formula, which on putting in the 
actual numbers gave 7, the correlation of the projections negative, as the direct investigation gave it. 
Thus the origin of the negative value for the projections’ correlation lies in the negative terms 
involving the cross-correlations of nasal bone with maxillary height and simotic subtense with maxillary 
wall, which are themselves positive. 
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themselves primarily significant for interracial comparison or evolutionary history. 
The intimate correlations of anatomical units renders them also very frequently 
of small importance as hereditary characters, and a complex of anatomical units 
may be much closer to an evolutionary unit,—that entity which as a whole is the 
subject of selective action. For respiratory efficiency the mesodacryal are may be 
of far more importance than either of its components, the maxillary or nasal bone 
portions. Thus we think it probable that the value of anatomical units in cranio- 
metric measurements is liable to be overrated, and in view of their complex and 
subtle intercorrelations their identification with independent “Mendelian units ”— 
whatever the latter may signify—is unjustifiable. 


In the series of cuts on pp. 428—9, we have endeavoured to give “ type ” contours 
of the nasal bridge for various races on the basis of the classification adopted in 
Table XXV. The reader is warned that in any race individuals may be found 
with many of these types. Our “types” are those of individuals whose simotic and 
maxillary nasal angles, $’ and ¢, are as near as we could get to the mean values of 
those angles in the race. Not too much weight must of course be placed on the 
classification of races thus reached, but the diagrams are not without suggestive- 
ness. We have added the Dog with its negative simotic subtense, a condition 
which is just reached in one or two Gibbons. In the case of the dog, however, the 
dacryon to dacryon section seems to miss the physiognomic nasal bridge, and we 
have given also a lower section termed on p. 428 the muzzle section. The short 
strokes mark for each contour the limits of the nasal bone; the contours are in 
each case from dacryon to dacryon, and were traced after orientating the skull so 
that the two dacrya and the summit of the bridge were in one horizontal plane. 
Obviously larger numbers of many more races would provide a more adequate 
classification, especially if extended into the animal kingdom generally*. 


* The Gibraltar skull is the only ‘‘ancient type” of man for which we have been able to obtain the 
nasal bridge constants. They are: 


Mesodacryal Subtense 8-6, Simotic Subtense 2°8 
a Cherd 220? 29 Chord  10°8 
99 Arc 31:0? 


These give 

a=39°l, a’ = 44-4, B=40°9, S=25°9, p=46°0, gp’ =27° 5, 
the nasal bridge is thus on the border of the simotic platygephyreal and simotic mesogephyreal groups. 
The difficulty, however, of determining the dacrya in the absence of the sutures in the cast used is very 


Gibraltar Skull. 


considerable. The contour of the nasal bridge is shown above. It will be seen to be in no way 
markedly differentiated. If we accept. it as approximating to that of the original, that original would 
have stood somewhere between the Negroes and the natives of the Moluccas for both the maxillary 
and simotic nasal angles, i.e. at the Chimpanzee and Symphalangus end of the scales. But the 
evidence is too fragile to lay any stress on this point. 
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(13) Concluding Remarks. We are aware how much of the present paper is 
open to criticism on account of the limited number of the racial series dealt with 
and the sparseness of the individual crania in those series. Notwithstanding we 
hope that the reader may find suggestive paths leading off from these preliminary 
researches. We believe that the time must come when in the larger Universities 
there will exist established schools of craniometry with adequate anatomical, 
physical and statistical training, and that such schools will carry on secular work 
in collecting and measuring. We have indicated in this paper that the study of 
the personal equation of craniometricians has hardly begun, but we look forward 
to the day when it will be as customary for one of the workers in an established 
craniometry laboratory to be adopted as a standard and his personal equation 
relative to other workers at home and abroad discussed, as it is for one astro- 
nomical observer to be tested against a second. The intercomparability of 
measurements is largely taken on blind trust by craniologists to-day. Only when 
relative personal equation has been studied will it be possible safely to pool the 
measurements of small series made by different laboratories. 

Again, anatomical and zoological museums and collections are singularly 
defective at the present time. If it were desirable to make a thorough study of 
the cranium of any animal other than man—say dog or horse—is there any collec- 
tion which could place at the disposal of the inquirer a hundred crania of definite 
sex, fairly uniform age and reasonable limits of race? We sadly fear not, and yet 
how much could be learnt of evolutionary descent by an exact study of a variety 
of species based on even a hundred crania from each! Take the plane of the 
foramen magnum, determined say by a plane through basion and opisthion perpen- 
dicular to the sagittal plane, and consider the angle this makes with the horizontal 


plane of the skull, as determined, say, by the Frankfurt Concordat. This angle 


—closely allied to Dauberton’s angle—is of very great evolutionary importance, but 
who can say—on the basis of really numerically adequate measurements* and other 
than from mere impression gained on a few specimens—what its value is in the chief 
mammals, in the anthrepoid apes, in negroes and in the various other races of man ? 
Comparative craniometry is almost at its origins if we refer, not to descriptions of 
“typical” + crania, but to the measurement of numerically adequate series. Not until 


. investigations, only roughly foreshadowed in such a paper as the present, have been 


made on a far greater variety of species and on a wide range of cranial characters— 
and this will have to wait until adequate material accumulates in our museums— 
would any suggestion of evolutionary descent pass into comparative certainty. 
When the reader realises that some 25 gorilla skulls of one sex, but of all ages 


* We do not wish for a moment to underrate Broca’s work in this field; but besides his views as to 
the horizontal planes, his series are often unsexed, and, especially for the chief mammals, wholly 
inadequate numerically. ‘Thus he uses two or three apes, dogs, cats, and so forth, where we need 50 to 
100 of each breed and sex. 

+ How few anatomists realise that nobody knows what a ‘‘ typical” cranium is until a long series of 
the particular race has been measured! We remember seeing in one museum a skull, labelled ‘‘ typical 
English,” and this at a date when not 50 English crania from one district and period had been 
measured, 
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and all districts, are about all that the English worker has to rely upon at 
present, and that it may be doubted whether he could find the same number of 
dog skulls of known sex, of one breed and of fairly certain age, anywhere in 
the country, he will, we hope, pardon one source of the inadequacy of the present 
study. 


In choosing the nasal bridge we had in view the importance of the nasal 
organs as a factor of survival, and we believe we have at least indicated that with 
caution the parts of the bridge can be measured and comparative results obtained. 
The measurements lead directly, we think, to the suggestion of a racial scale and 
throw light on the importance of dealing with groups of correlated characters 
and not with isolated anatomical units. The classification we have suggested, 
both in the limits of its categories and its terminology, is of course not put 
forward as final, but we believe it will be helpful as a step to a good ultimate 
classification and as enabling meanwhile the characters of the nasal bridge of any 
other race to be associated with those treated in this paper. The needful measure- 
ments are simotic chord and simotic subtense, and again mesodacryal chord and 
subtense; from the first two the simotic nasal angle can be readily found and 
from all four the maxillary nasal angle. The two taken in conjunction will enable 
the recorder to classify the nose by aid of Table XXV. 

On the whole we consider that Mérejkowsky broke fruitful ground, especially, 
if the discussion of the nasal bridge be extended to the mesodacryal chord and 
subtense, which appear of more evolutionary importance than his simotic chord 
and subtense ; but there will undoubtedly have to be standardisation of individual 
craniometricians in a far higher degree than even for the macrometric measure- 
ments of the skull, if these micrometric measurements are to be dealt with con- 


fidently. 


Addendum. Since this paper was written, it occurred to us that the Moriori 
crania from Chatham Island in the Royal College of Surgeons might be worth 
investigating from the standpoint of the nasal bridge—as they are undoubtedly 
worth investigating from their general craniometric importance. 


The nasal bridges, however, turned out to be of no markedly primitive type, 


as the accompanying contours will indicate. 


Moriori Nasal Contours. 
They are mesotic mesogephyreal, and stand between Malay and Aino with: 
3 f 
p 46-0° 49:0° 
Pp 40°6° 343° 


45 crania were taken as male, 15 as female. 
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There is a somewhat noteworthy difference between male and female, but a 
still more remarkable feature of the results is that for the whole eleven characters 
discussed the male is more variable and in some cases no less than twice as 
variable as the female. Such exaggerated differences in variation in the sexes 
are so unusual, especially in primitive races, that we believe that the noteworthy 
cranial characters of this race, in which “ maleness” seems almost a racial character, 
may have led to a far too stringent selection of female crania. Thus on the 
present sexing the female coefficient of variation of the mesodacryal subtense is 
8:0, of the mesodacryal index a, 8:0, and of the simotic index 14°4,—all values 
much below any reached for females of the human races given in Table XIV. 
We consider it better to postpone publication of the Moriori results, until a 
fuller study has been made of all the craniological characters of the Royal College 
series, which we hope may soon be undertaken. 


Tables A—D give the measurements made for the apes only. The tape 
measurements of the mesodacryal are scarcely ever permitted of the determination 
to tenths of a millimetre, and this makes the tenths in the mesodacryal chord of 
small value, when we are determining the arcual index 8. In the tables doubt 
as to a value is only expressed in the columns of the absolute measurements and 
- not repeated in the index columns deduced from these absolute measurements. 
Of course the index values have no more weight than the data from which they 
are deduced. 


Note. The Editor is responsible for the terminology used in this paper. He 
fully admits its barbarism. But no English form for the Greek adjective exists, 
and when a distinguished classical scholar fails to find one, science must take the 
matter into its own hands, and adjectives in -otic seem the only possibility. 
Simotic has practically Greek sanction; the pathologists have already adopted 
stenotic; thus the step to mesotic was not a very great one, and it is shorter 
than mesomorphic or mesorhinal,—were indeed compounds in rhinal not already 
allocated. 
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TABLE A. Characters of the Nasal Bridge in the Gorilla. 
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Nasal Bridge in the Chimpanzee. 
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TABLE B. Characters of the Nasal Bridge in the Chimpanzee. 
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1 | 18 = a az, = ores iM oct Young. Third molar and canines erupting 
2049" io} if = = = |Paroke Museum seen tommaxillary suture obliterated 
979 =: = ” ” . | Sutures clear 
4 3 te => | a . oo cee ten ... | Sutures obliterated 
- - Jt = = \ 
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3b a 13:8 4) eoie 10:08 ale 2255 93-2 —20°7 124°6 
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Locus 


} 
Remarks 


| Cambridge Zoological 


Royal College of Surgeons 


” ” 
University College, Zoology 
Natural History Museum ... 


Laboratory 


” 
” 
” 


Department 


| Sutures 


Split skull with ill-fitting halves 
Sutures very vague 
Sutures clear | 
Sutures clear 
Sutures clear 
clear 
clear 
vague. 


Sutures 
Sutures 
Sutures clear 

Young. Sutures clear. Third molars erupting 

Adult, but one third molar never erupted. Sutures clear 
Sutures clear 


Teeth misplaced 


| Sutures clear 


Lacrymo-maxillary suture not very clear 

Sutures clear [side 
No nasal bones. Maxillary bones meeting with ossicle on right 
Sutures fairly clear. Unusually broad nose for an orang 

Very aged. A large skull with broad nose and much deve- 

loped crests. Sutures vague 

Fragment only. Sutures clear 
In the Whale Room 


In the Whale Room 

Sutures clear 

Sutures clear 

Aged. Sutures rather vague 

Lacrymo-maxillary suture obliterated 

Lacrymo-maxillary suture obliterated 

Sutures clear. No nasal bones 

Aged. Sutures clear {much developed crest 

Aged. Sutures clear. Very long “spindle” nasal bone and 

A large skull with rather vague sutures 

Lacrymo-maxillary suture obliterated 

Sutures clear 

Sutures clear 

Sutures clear. Nasal bones very narrow 

Lacrymo-maxillary suture obliterated 

Sutures clear. Very wide nose [bones extremely narrow 

Broken orbits and therefore measurements uncertain. Nasal | 

Sutures clear. Very large skull 

Sutures clear 

Young adult. 

Young adult. 

Sutures clear 

Young adult. Sutures clear 

Lacrymo-maxillary suture absent, other sutures clear 

Young. Third molar and canine erupting. Split skull and 
therefore uncertain measurements. Believed to be large 
species. Omitted from statistical reductions 

Right lacrynio-maxillary suture obliterated, left placed high on 

Sutures clear [side of nose 

Young adult. Four upper molars on both sides and four lower 

Sutures clear [molars on left side 

Young. Third molar and canines erupting. Believed to be 
large species. Omitted from statistical reductions 

Sutures clear 

Sutures clear 

Sutures clear 

No nasal bones. 

Sutures clear 

Young adult. 

Young adult. 

Sutures clear. 


Sutures clear 
Sutures clear | 


Maxillary bones much developed 

i [incisors and canines 
Bicuspids on each side misplaced behind lateral 
Sutures clear 
2Ossicle at dacryon on left side 
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Nasal Bridge in the Orang-utan. 
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— = Se | = 2 ; Se ion a —— 
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| $ | 10° 93-4 ” ” » - sued cleave 
d 107 we : ... | Sutures clear 
| g | H 160 ” » » cee .. | Sutures clear 
| g Q ee} 2 ” ” No + | Sutures clear 
1" » » ” ox -.. | Sutures cle: | 
| ‘ 1 o » > on .. | Sutures c! | 
> ” » sce ... | Sutures y Teeth misplac 
| ‘ | i ” ” ” con ». | Sutures c ee 
| i ” ” wr on --- | Young. Sutures clear. Thir i 
z 2 ene s clear, hird molars erupting 
| . Boo 5 ut thir Y never er Sut 
; 2 z = Sipadule | ubiene hird molar never erupted. Sutures clear 
ae) » 0 -.. | Sutures clea 
University College, Zoology rte! matt | 
| j : Nuon Diss go 1 Zooloe) Department. Pacrimo masilary suture not very clear 
| | ae Sutures clear S 
1 
‘ p 2 D na Ronissal bones: Miasillany bones meeting with ossicle on ue 
Sutures fairly clear. Unusua roa se for an ora 
é ” 5) 066 a0 ... | Very aged. “IN large PROS : ‘a mack 
: lee loped crests. Sutures vague oe She mehiders 
} ? ” ” Fragment only. Sutures clear 
| | z D » In the Whale Room 
, ” 
0S ” ” ” 1 
| «| In the Whale Room 
| 3 ” ” ” - | Sutures clear 
| F % ” ” | Butures clear 
” ” «| Aged. Sutures rather vague 
i 2 ” ” Lacrymo-maxillary suture obliterated 
ee ” ” . | Lacrymo-maxillary sutu liter: 
is é Lacry 'y ire obliterated 
‘ ” » Sutures clear. No nasal bones 
¢ 2 » ” Aged. Sutures clear {much developed crest | 
| 4 z Y ” Aged. Sutures clear. Very long “spindle” nasal bone end 
5 a , » A large skull with rather yague sutures ‘ 
3 | » ” Lacrymo-maxillary suture obliterated 
é | z ” ” Sutures clear 
| ) ” » Sutures clear | 
| 4 2 y ” + | Sutures clear. Nasal bones very narrow 
| é | Y oD ” oc pactyand. maxillary suture obliterated 
a AS ted ooo +» | Sutures clear. Very wide nose bones extn arn 
| 4 Broken orbits a therefore Snes a Teall ] 
| 3 tures clear. Very large skull Gio. 2 
4 Sutures clear Pape 
; Young adult. Sutures clear 
3 1313 Young agule Sutures clear 
is 1123 | 3 uitures clear 
: » 7) a Young adult. Sutur r 
: 190-0 gs . Sutures clear 
8 nos x z » ” Vactyme maxillary suture absent, other sutures clear 
” » oung. Third molar and canine erupting. Split skull and 
| | | therefore uncertain measurements. Believed to be large 
anes fal - For Gambridsal Zaclen cele netees re mt species: Omitted from statistical reductions 5 
fe 4 | 7 a | e ; y Higheleerymoomalary suture obliterated, left placed high on 
; | 17 = M49 % a clear [side of nose 
i > | 4 | ry = 208 2 a i Moung edits Four upper molars on both sides and four lower 
p st =189 ‘ ear molars on left si 
| j ” » ” + | Young. Third molar and canines Brae palievedt mae 
, ; 3 | ae Sees sbentes: Omitted from statistical reductions 
nee 00 — 20°3 o 
; | ‘ “ Sutures clea: 
ey | 10°3 —19°3 : 
2 d 103 19: Sutures clear 
: all ~149 \ 
| aa; a 2 | 84 —28'1 No nasal bones. Maxillary bones much deyeloped 
2048 | 3 nO — 27-0 Sutures pleas ae _ __[ineisors and canines 
3 eh - is ae adult, Bicuseis ip on(each side misplaced behind lateral 
: f | Se 2 ” S : i 
| 3 | | [ | | ; f » ... | Sutures clear. ?Ossicle at dacryon on left side 
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TABLE D. Characters of the Nasal Bridge 


Meso- | Meso- 

dacryal | dacryal 
Are Subtense | 
20 4°3 
16 |} 35 
10 lemme le 2 
17, 3°9 
20 4°3 
22 3°9 
17'5 | 3°D 
23°82?) 5:97 | 
20:5 eee 
15 ay} 
15 4°] 
1 | 4°3 
16°5 4 3°6 
13°5 2°8 
142 OR aoc 
16°5 4-2 
20 D7 
22 2 5:4 
1575 117/ 
17°3 3°9 
17°4 ? 46? 
13 4 4°3 
iy 4:0 
18? 4°4 
18 3°1 
16 3°5 
16°6 3°0 
20 3°7 
19 4°5 
16 eee 7h 
ee ha 
12 Q°4 
Tay |), * 37 
15°3 45 


Mesodacryal Indices 


ow et 
10 © 
oO oc 


Simotic 
Chord 
a’ a—a’ B 

SUP. SCR Biles = 
47-4 | —15°6 45°5 5°5 
21°2 79 111 4°3 
46°5 —13°4 44:1 6°6 
39°2 -—10°5 33°3 5°4 
43°3 —18°6 39°2 — 
43°8 — 15°38 40°0 —_— 
49°5° | =12°6°| 48-8 | Peston 
46°7 | —15°0 44-4 6-2 
OOs3a0s |e —wlefeco: 50°0 5°6 
52°2 —10°4 53°1 5°5 
512 — 7:8 51°5 4°3 
50°3 —17°6 50:0 5°82? 
42°3 —13°7 37°8 6:0 
55°1 —15°1 57°83 4:0 
46°9 —10°1 44°8 4:02 
80°8 23°2 102°0 6:0 
62°0 — 20°5 69°2 — 
36°3 —22°1 29-2 6-0 
48°1 — 15:0 46°6 5:3 
60°8 — 16°6 67°3 — 
61°3 —19°6 68°0 4:31 
69°9 —26°9 82°8 57? 
58°6 —18°6 63°6 — 
61°4 —32°4 68°2 5:0? 
44°8 —13'8 41°6 6:0 
47°5 —21°2 45°6 5:2 
54°2 — 25°3 56°3 — 
64°0 -— 23°1 72°7 — 
67°0 —25°9 77°8 — 
64°5 —22°6 73°5 — 
39°2 —12°5 33°3 5°9 
55:9 —22°3 59°] _- 
62°4 —12°4 70°0 6°6 


Simotic 
Subtense 


—~ 


Oro wR Oo Ot OO WF 
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| 
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oles 


elirerrconteal 


eels | | 


The symbol @ denotes sex unknown. 
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in the Gibbons, Hylobates and Symphalanqus. 


andes, Locus Remarks | 
= Natural History Museum | Hylobates syndactylus. From Pahang. Sutures obliterated 
16°4 op fe) > | From Sumatra. Very young 
7:0 ‘s . FF Symphalangus. From Sumatra. Young adult 
12°1 ® 7 Symphalangus. From Sumatra. Asymmetric lacrymal bones | 
3°7 9 » 5 Symphalangus syndactylus. From Sumatra. Old. Sutures 
vague 
-— * 5 5 Symphalangus syndactylus. Sutures vague. Nasals absent _ 
at the bridge 
= 3 ‘ 5p Symphalangus syndactylus. From Pahang. Very old. Sutures 
quite obliterated 
8:8 | Royal College of Surgeons | Symphalangus. Old. Sutures obliterated 
129 99 6 Symphalangus. Old. Sutures rather vague 
8:9 *5 5 i Hylobates sp. 
5°5 = os _ Hylobates sp. Young 
9°3 es $6 .; Hylobates lar. Adult 
52 3 . . Hylobates sp. Old. Sutures vague, nasal sutures obliterated 
6:7 . 7 * Hylobates sp. Young 
12°5 ss * - Hylobates sp. Very young [vated | 
12°5 | Natural History Museum | Hylobates leuciscus from Malacca. Very old. Sutures oblite- | 
16°7 . 5% ” Hylobates hainanus from Hainan. Young. Peculiar lacry- 
mal bones [rated 
— 5 . PY Hylobates gabriella from Annam. Old. Sutures quite oblite- 
-1:7 * Ph - Hylobates mulleri from N. Borneo. Nasal sutures vague 
119) A ms % | Hylobates agilis 
— 55 is 7, | Hylobates agilis from Sumatra. Sutures obliterated 
4-7 o ss “ Hylobates leuciscus from Java. Sutures obliterated 
18 | Hylobates hoolock. Young. Apparently no nasals, but pro- 


| 


— 
oWF 
DwWO 


[sal 


cess of frontal 


Hylobates lar. 
Hylobates lar. 


Hylobates leuciscus from Borneo 
| Hylobates lar. 


Hylobates lar. 
Hylobates sp. 


Hylobates lar. 
Hylobates lar. 
Hylobates lar. 
Hylobates lar. 
Hylobates sp. 


Sutures obliterated 
Sutures obliterated 


From Malay Peninsula. 
From Malay Peninsula. 


From Malay Peninsula 
From Malacca. Sutures obliterated 
Ridge on maxilla. Nasal sutures obliterated 


Sutures obliterated 
Nasals broken 


From Malacca. 
From Malacca. 
Young 
Sutures obliterated, bisected skull 
Very young 
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Museum 
Number 


1006 a 


0.10.1. 8 
1031 081216, 
79. 8. 50. 1 
10. 10. 1.7 
54. 10,5. 1 
{1006 a 


1095 a 
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Or. OF BAKO, 


CEOSOSOEOS OKO HOOE OAH OK OQKOOFOK OS = DAO 4004+ 0¢ 0510050, 


TABLE 
| Mesodacryal Indices 
Meso- Meso- Meso- | 
dacryal | dacryal | dacryal | 
Chord | Arc |Subtense) 
13-2 20 
11:0 | 16 | 
90 10 
LBS |) 17 
15-0 20 
| 
| 15:8 | 92 39 
12°5 175 | 35 
| 
16°07 597 
| 14:2 455 | 
10-0 33) | 
98 41 
99° | 43° | 
110 36 
98 | 2:8 | 
9-0 36 
| 11-4 42 
9-9 57 | 
a4 
Le7e | 
| Bit) 
| 461 
43 
| 40 
11:07) 187 | 4:4 
10°7 18 31 
11°83 16 35 
} 114 16-6 ) 
12°8 20 37 
11:07} 19 4°5 
907 16 87 
} 98 7 | 4:2 
9:0 12 | 24 
11:0 75 37 
90 15°3 45 | 


The symbol g denotes sex unknown. 
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in the Gibbons, Hylobates and Symphalangus. 


Simotic Li 
Index ocue 
_ Natural History Museum 
16-4 i i ‘ 
70 ” » ” 
121 » » » 
37 ” » ” 
” ” ” 


nO U1 bo cD 
AN BASSHR 


on 


Dwg 
stack 
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| Hylobates hoolock. 


Remarks 


Hylobates syndactylus. 
From Sumatra, 
Symphalangus. [rom Sumatra. 
Symphalangus. From Sumatra. 
Symphalangus syndactylus. 
vague 
Symphalangus syndactylus. 
at the bridge 
Symphalangus syndactylus. From Pahang. Very old. Sutures 
quite obliterated 


From Pahang. 
Very young 


Sutures obliterated 


Young adult 
Asymmetric lacrymal bones 
From Sumatra. Old. Sutures 


Sutures vague. Nasuals absent 


Symphalangus. Old. Sutures obliterated 

Symphalangus. Old. Sutures rather vague 

Hylobates s; 

Hylobates Young 

Hylobates lar. Adult 

Hylobates sp. Old. Sutures vague, nasal sutures obliterated 
Hylobates sp. Young 

Hylobates sp. Very young [vated 


Hylobates leuciseus from Malacca. 
Hylobates hainanus from Hainan. 
mal bones 

Hylobates gabriella from Annam. 
Hylobates mulleri from N. Borneo. 
Hylobates agilis 

Hylobates agilis from Sumatra. 
Hylobates leuciscus from Jaya. 
Young. 


Very old. 
Young. 


Sutures oblite- 

Peculiar lacry- 

[vated 

Old. Sutures quite oblite- 
Nasal sutures vague 


Sutures obliterated 
Sutures obliterated 
Apparently no nasals, but pro- 


cess of frontal 

Hylobates lar. From Malay Peninsula. 
Hylobates lar. From Malay Peninsula. 
Hylobates leuciscus from Borneo 
Hylobates lar, From Malay Peninsula 
Hylobates lar. From Malacca. Sutures obliterated 
Hylobates sp. Ridge on maxilla. Nasal sutures obliterated 


Sutures obliterated 
Sutures obliterated 


Hylobates lar. 
Hylobates lar. 
Hylobates lar. 
Hylobates lar. 
Hylobates sp. 


From Malacca. 
From Malacca. 
Youn: 

Sutures obliterated, bisected skull 
Very young 


Sutures obliterated 
Nasals broken 
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ON THE CALCULATION OF INTRA-CLASS AND INTER- 
CLASS COEFFICIENTS OF CORRELATION FROM CLASS 
MOMENTS WHEN THE NUMBER OF POSSIBLE COM- 
BINATIONS IS LARGE. 


By J. ARTHUR HARRIS, Ph.D., Carnegie Institution of Washington, U.S.A. 


I. INTRODUCTORY REMARKS. 


Ir one designates as classes the groups of measurements or evaluations 
segregated out of the whole series under consideration, then those correlations 
which measure the similarity in character of the individuals of the same class 
may be designated as intra-class coefficients of correlation, while those which 
measure the resemblance between the individuals of different but (an some 
manner) related classes may be designated as inter-class coefficients of corre- 
lation. 


Intra-class and inter-class should not be confused with intra-racial and 
inter-racial. These latter terms seem to have been first used in a memoir on 
Naquada Crania by C. D. Fawcett* and discussed for the local races of the 
lesser celandine+ and in greater detail with anthropometric illustrations by 
Jacob, Lee, and Pearson{. Examples of the practical use of inter-racial 
correlation coefficients are also given by Tschepourkowsky§. By intra-racial 
correlation as the term is used in these discussions, one understands those 
coefficients expressing the interdependence between the measures on a series 
of individuals of the same race. Under inter-racial are included the correlations 
between the types (means, modes) of a series of races. 


In intra-class and inter-class correlation, on the other hand, the unit is always 
the individual (in the statistical sense), not the type of a series of individuals. 
The purpose of the correlation is to determine the resemblance for any series of 
characters of the individuals of sub-classes, or of an associated pair of sub-classes, 


* Biometrika, Vol. 1. pp. 460—461, 1902. 
+ Biometrika, Vol. 11. pp. 152—153, 1903. 
{ Biometrika, Vol. 11. pp. 347—356, 1903. 
§ Biometrika, Vol. 1v, pp. 161—168, 286—312, 1905. 
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as compared with random pairs from the population (or universe) which they 
constitute. Thus fraternal correlations are intra-class and also intra-racial, as 
are also homotypic coefficients. In one case, the “class” is fixed by ancestry 
and family environment, in the other, by ancestry and individual environment. 
Avuncular correlations—in which all nephews (nieces) of the descendant genera- 
tion are compared with all the uncles (aunts) of the ascendant generation—are 


illustrations of inter-class, but also intra-racial, correlation. 


The most familiar illustration of intra-class correlations are the fraternal and 
homotypic; practically the only inter-class relationships hitherto considered are 
the avuncular and the cousinship in heredity. But these correlations might be 
powerful research tools in morphology and physiology as well as in heredity 
and sociology. 


The reason for their limited use in the past is in part technical. When the 
number of observations is as large as is desirable, the formation and verification 
of the ordinary correlation table (where each individual of the V used is entered 
only once) is an irksome task, but when each measurement is compared with 
a number of others, the purely clerical labour involved in tabulation becomes 
onerous in the extreme. If n be the number of individuals in any class, the 
number of combinations within the class is }n(m—1), while if the tables be 
rendered symmetrical* as is generally desirable+, the number of combinations 
for each class is n(n—1), which gives for the m classes constituting the popu- 
lation S [n(n —1)] or, where n is constant, m [n(n —1)] entries in the intra-class 
correlation surface. With n as low as 20 and m only 250, this gives 95,000 
combinations. In inter-class correlation if p be the number in the first and 
q the number of observations in the second (associated) class, the permutations 
for a class are pq, and the entries for the population S(pq), or mpq if both 
p and q are invariable. Obviously, neither m, p, nor q need be large to give 
very heavy tables. 


The labour involved in constructing such tables and making sure that they 
are free from errors is very great indeed. To meet this difficulty in extreme 
cases, Pearson} has shown how formulae may be deduced for obtaining the 
coefficient of correlation between any grades of kindred from the means of 
arrays into which the kindred may be grouped. The method suggested here 
reduces the work to still simpler terms, obviating entirely the necessity for the 


* For discussions of the advantages of the symmetrical table see K. Pearson and others, Phil. 
Trans. Roy. Soc. Lond. A, Vol. cxevu. pp. 285—379, 1901; R. Pearl, Biometrika, Vol. v. pp. 249—251, 
1907; K. Pearson and A. Barrington, Eugenics Laboratory Memoirs, No. Y., 1909; J. Arthur Harris, 
Biometrika, Vol. vir. pp. 214—218, 1909. 

+ Jennings (American Naturalist, Vol. xtv. pp. 123—128, 1911) has suggested a method of calculating 
the symmetrical table constant without actually rendering the table symmetrical. I believe his method 
is not serviceable where more than a single pair of characters are involved. 

£ Pearson, K., Phil. Trans. Roy. Soc. Lond. A, Vol. cxcit. pp. 271—274, 1899. For other methods 
of dealing with problems involving many measurements see Biometrika, Vol. 11. pp. 69—71, 77—78, 
1902. 
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formation of tables, for the calculation of the means of classes, the standard 
deviation of classes, or the correlations between them, thus reducing the labour 
involved in the determination of intra-class or inter-class correlations to a small 
fraction of that hitherto required. While these short methods when properly 
applied give coefficients agreeing exactly with those deduced from tables, they 
cannot be recommended without two special warnings. (a) In the hands of 
inexperienced calculators abstract formulae are likely to result in blunders 
which would not occur, or at least could be easily detected, if the computations 
were made from tabled data. (b) The tables are, in some of the cases at 
least, desirable for other purposes, eg., the testing of linearity of regression. 


The underlying principle is very simple. The familiar product moment 
formula for rz, requires merely the first two moments of # and y around the 
means and the product moment & (a’y’). Of course, the rough moments may 
be taken around any origin whatever and referred to the mean by a proper 
formula. In many cases, there are material advantages* in taking 0 as the 
arbitrary origin. = (a’), X(y’) are then merely summations of the values of 
the individual variates; = (a), = (y®) are summations of their squares; ¥ (a’)/N 
is the mean, m and o,=V> (2) N—m,2. Then 


‘oy = a CAB! ey (i). 


oxy 


Now the calculation of the rough moments and the product moments about 
0 as origin for individual classes is a very easy task. Indeed, with the aid of 
a proper machine they may be rapidly obtained without seriating the measure- 
ments at allt. Since these rough moments for classes are taken about 0), they 
can be summed, weighted and summed, or subtracted in any way we please to 
obtain the rough moments for a correlation surface for the whole population. 
All of this is rapid machine work requiring but a moiety of time needed in 
the formation of tables of thousands of entries. Formula (i) is so well known 
that I shall do nothing further in the way of proof of the validity of the 
formulae suggested here than to show that the data from the first two moments 
and the numbers of individuals in the classes can be thrown into an equivalent 
form. 


A word may be said concerning the illustrations given. All are, I believe, 
of intrinsic biological interest, but they have been chosen primarily for their 
convenient shortness, which facilitates not only the publication of the raw data, 
but the checking through of the arithmetical work by the reader. Thus they 
do not always represent the biologically best series which might have been 
selected. 


* Harris, J. Arthur, ‘‘The Arithmetic of the Product Moment of Calculating: the Coefficient of 
Correlation,” American Naturalist, Vol. xxv. pp. 693—699, 1910. 

|+ After a little practice with the machine, one can sum the squares almost as readily as the first 
powers entered on the record sheets, providing the grades are not too high. 
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Il. SvTareMentr AND ILLUSTRATION OF PROBLEMS. 


Problem I. To form Direct or Cross Intra-class or Inter-class Correlation or 
Contingency Tables when the Number of Possible Combinations is Large. 


A method for the rapid formation of the complete direct intra-class table 
has been described and illustrated elsewhere *. The method of dealing with 
both direct and cross intra-class correlations by means of condensed tables has 
also been indicated+. Both direct and cross inter-class correlation tables (full 
or condensed) are formed by the same method, but more simply since the 
entries are S(pq), not S[n(m—1)], and the tables as first formed can be used 
without reduction. The method of dealing with fractional inter-class correlation 
tables will be illustrated as a check under Problem V. 


Problem IT. To Determine the Correlaticn between a Character and an 
Array of Associated Characters. 


This serves chiefly as a simple introduction to the more complex cases to be 
described presently. One wishes often to correlate between a first measure and 
an array of associated measures}. Tables may be difficult to form or cumber- 
some, while moments of the arrays of measurements around 0 as origin, & (y’) 
and = (y'), may be wanted for other purposes, or, if not, are at least quickly 
calculated. ‘Then, by the use of a machine for simultaneous multiplication and 
addition, we get for the rough moment coefficients 


SCE) — Te — Mp UNE) (WV) =e secant neeee (11), 
Seow — Is SY VIS (DV Secs nen ccenaees (iii), 

and for the rough product moment coefficients 
SE AIDS (A INESH (0) ie ae ieee ye Reeree oe one (iv), 


where n is the number of y measurements associated with any a, NW =S(n) is 
the number of measurements in all arrays or classes, } denotes a summation 
within the class, and 8S a summation of classes for the whole population. From 
this point the work is straightforward by (i), de. 


ne S fa’ [X(y’)]} N — memy 


OxTy 


The particular advantages of this method of calculating correlation are seen in 
cases where: (a) the class moments have already been calculated for other 


* Harris, J. Arthur, “On the Formation of Correlation and Contingency Tables when the Number 
of Combinations is large,’’ American Naturalist, Vol. xnv. pp. 566—571, 1911. 

+ Harris, J. Arthur, ‘‘The Formation of Condensed Correlation Tables when the Number of 
Combinations is large,” American Naturalist, Vol. xuv1. pp. 477—486, 1912. 

~ A single parent may produce a score or a hundred offspring. Or, one may desire to know the 
relationship between the weight of the seed planted and the weight of the seeds produced, or between 
the number of pods per plant and the number of ovules and seeds produced per pod. 
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purposes, (b) it is desirable to calculate several constants from the same 
material with modified grouping or weighting of some of the classes, (c) it is 
desired to obtain the correlation without seriating the measures for the classes 
at all*, 


Problem III. To Determine Direct and Cross Intra-class Coefficients of 
Correlation from the First Two Moments of the Individual Classes. 


Let w be the measure of a first and y the measure of a second character of 
an individual of one of the several classes forming the population N. Then if 
correlations for the population be determined, 7,,, where « and y are measures 
upon the same individual, may be called an organic correlation, while rz,2,, Ty, yo 
may be designated as direct and 7,,,, as a cross intra-class correlation, where 
the subscripts 1 and 2 indicate that the measures are taken on a “ first” and 
a “second” individual of the class respectively. 


A. Direct Intra-class Relationships. 


Let n be the number of individual measurements, e.g. of siblings or homo- 
types, in one of m classes. Let ¥ indicate a summation within any class and 
Sa summation of classes. For an individual class the rough moment coefficients 
around 0 as origin are ¥(a’)/n and ¥(«#)/n, while the possible combinations 
of individuals are n(n—1). Since the table is symmetrical m,=m,, o,= 0, and 
the mean product moment for any class is 


{eR = Se?) MiG 1) ee ae eee (v), 

or for the whole population of m classes 
S {P= Sey lve DV) ee ee eee (vi), 

or when n is constant from class to class 
SU @)P— >a mile Gy) ee eee eee (vil). 


To complete the calculation of + only the first and second moments for the 
population are necessary. ‘Two cases are possible: (a) the n’s are identical for all 
classes, and (b) n differs from class to class. In case (a) the moment coefficients 
for the whole weighted population of S [n(m—1)]=m[n(m —1)] individuals are 
unchanged by weighting, and are given by 


SIS@)/M, SISGONN see ee eee (viii). 


For case (b) the arithmetical routine is a little longer, the classes being weighted 
differently, but always in a(n—1)-fold manner. Thus the moment coefficients are 


S[(n-1)E(2’)|/S[n(w—-1)], S [(n—1) 2 (@)]/S [n(w—-1)]...... (ix). 
From this point the calculation of r is straightforward by (i). For clearness 
both case (a) and (6) will be illustrated. 


* In this case the data for the means and standard deviations are obtained by summing first and 
second powers of the numbers from the record sheets without classification. 
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Illustration (III—Aa) 1. Symmetry in the Ovary of Hibiscus. 

The morphologist interested in the degree of perfection of morphogenetic 
processes may desire to measure the degree of precision with which the different 
“ Anlagen” which go to make up a composite organ, say for example a quinque- 
locular fruit, are differentiated off from the common primordium and carried 
through embryogenesis. One convenient measure is furnished by the intra- 
ovarial coefficient of correlation. 


Take as an illustration a series of 1000 five-celled fruits of Hibiscus Syriacus 
from the Missouri Botanical Garden in the fall of 1905. Table I gives the sym- 
metrical intra-ovarial correlation surface for number of ovules per locule. 


Since all fruits have the same number of locules, the constants are not 
changed by weighting, hence, from Biometrika (Vol. vul., pp. 61—62; M.B.G., 
1905) we take 0=6°4648, o, ='892166. From Table I. 

S (0;'0.’)/N = 841990/20000 = 42:0995, 
r= {8 (0;'0,)N — &}/a2 = (42-0995 — (6-4648)}/(-892166) = 3843. 
TABLE I. Symmetry in the Ovary of Hibiscus. 


Ovules of Second Locule. 


Ss @ | Gilg | | 6 | 7 B. \otals 
> — —EE 
ele ee an Ee See a a 12 
a Oh Pal nee 1 ene 8 20 
p y 1/—]| 12] 28] 132] 56] 23] 259 
a 5 2/ 1] 28| 220) 899] 255| 551 1460 
a 6 6| 7] 1321] 899 | 6384] 1871 | 605 | 9904 
3 7 9/ 4| 56] 255 | 1871 | 2014 | 1134] 5336 
2 8 1) 8] 23) 55 | 605 | 1134] 1190] 3016 
a 

5 | Totals 1460 | 9904 | 5336 | 3016 | 20000 


But in Biometrika (loc. cit. Table VI) we have seriations of total ovules per 
fruit, = = (o0’) for locules. From this and the distribution for individual locules 
we have for the product moments, by (vii), 


S[¥(o)P = 1054938, S (0) = 212948, m[n(n—1)] = 20000, 
whence 
p= SL2 PSO) mn (w= D1—& _ 149-0995 — (6-4648))}/(-892166)? = 3843, 


orm 


avoiding all the labour of forming a correlation surface of 20000 entries, but giving 
no information at all as to the nature of the regression curve. 


Illustration (III—Aa) 2. Intra-ovarial Correlation for Seed Production. 
Next consider a physiological problem from the same data. Table II gives 
the intra-ovarial (or inter-locular) correlation surface for seeds per locule. 
$=3:9056, o,=1:756442, S(s,'s:’) = 339360, 
whence r = {S (s,'8,')/20000 — (3:°9056)?}/(1'756442)? = 5557. 
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TABLE II. 
Intra-Ovarial Correlation for Seeds per Locule. 


Seeds of Second Locule. 


| 0 1 2 3 | 4 | 5 | 6 | 7 | 8 | Totals | 
0 70,|- 123), 1268540) 3 12 4) — | -- 420 
1 | 128) 334| 414] 311 | 150) 72} -47| 8] 14 1460) 


126 414 714 642 447 204 102 | 23 4 2676 
54 311 | 642 | 1002 854 523 237 47 14 3684 
447 854 1036 868 522 | 102 26 4036 
12 72 | 204 523 868 1010 752 | 179 52 3672 
4 47 | 102 237 522 752 916 | 252 84 2916 
1| 23 47 102 179 252 | 150 79 840 


— 1 4 14 | 26 52, 84 79 36 296 


rt Ss %% 
w 

vet 

Et 

or 

~~ 

Oo 


pa ree 
DS 


md 


Seeds of First Locule. 


2916 | 840 


Totals | 420 1460 | 2676 | 3684 4036 | 3672 


But the same result can be obtained from the constants for seeds per locule and 
the seriation of total seeds per fruit (Biometrika, Vol. vi, pp. 60—62) in . 
precisely the same way as in the preceding illustration without the excessive 
labour of tabling 20000 combinations. 


Suppose, now, we had been given the physical constants only—not the 
distributions—for seeds (or ovules) per locule and total seeds (or ovules) per 
fruit. The correlation is still easily deduced, for the sum of the squares of 
total seeds, =S[S(s’)]? for locules, is given by substituting from Biometrika, 


Vol. vit, Table XII, = 

SE (s')P/N = % (s:2)/M = 02 + 82 = (70506)? + (5 x 3:9056)? = 43105374436, 
where the subscript ¢ indicates total seeds per fruit. Since V = 1000 for fruits, 

S[X (s')P = 48105374. 
In the same manner we get for the individual locules 
> (s”)/N = 02 + 5? = 18'338652, 
or since NV for locules = 5000, }(s”) = 91693:26. Hence, by (vii), 
r = {(431053°74 — 91693:26)/20000 — (3:9056)} (17564) 
= 1:71431264/3:084941 = °5557. 


The extent of agreement with other methods depends merely upon the number 
of decimal places retained in the constants. 


The importance of intra-class correlation coefficients in physiological work is 
emphasized by a comparison of this with the preceding, III (Aa) 1, result: 
For ovules per locule, 7 ="3843 + ‘0081. — 
For seeds per locule, 7 =°5557 + ‘0066. 
Excess for seeds, ='1714 + 0104. 
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The probable errors were calculated on the basis of the actual number of 
locules, n = 5000. 


Apparently differences (a) in the vigour of the ovules, or (b) in the chances 
of fertilization, or (c) in the supply of seed forming substances, from fruit to fruit 
tend to render the locules more alike in the number of seeds they produce than 
they are initially in number of ovules*. The discussion of these points must, of 
course, be reserved until all the data can be set forth. 


- Lllustration (III[—Aa) 3. LIntra-ovarial Correlation for Seed Weight. 


Case (Aa) may be further illustrated by a series of seed weights to be 
discussed in a forthcoming paper on Crinwm. Ten seeds from each of 200 fruits 
taken from a fine row of plants in the Missouri Botanical Garden in the spring 
of 1906 were weighed. From Illustration 2 it appeared that in Hzbiscus the 
locules of the same fruit have a tendency to produce similar numbers of seeds. 
Here the problem is to determine to what extent the seeds from the same 
fruit are alike in weight. The seeds range from 1—18 half-gram units+. This 
would give a symmetrical intra-fruit correlation table of 18?= 324 compartments 
and 200 x 10 x 9=18,000 entries. The work is very rapidly carried out by my 
formulae. The total weight in units for each of the 200 fruits is obtained by 
simple addition. These, shown in Table III, are of course the class = («’)’s around 


TABLE III. 


Total weight of seeds (in half-gram units) in Individual Fruits of Crinum. 


10 | 14 | 19 | 21 | 24 27 | 29 | 31 | 33 | 86 | 39 | 41 | 43 | 45 | 49 | 54 ; ‘ 
10 | 14 | 20 | 21 | 24 | 27 | 29 | 31 | 34 | 36 | 39 | 41 | 43 | 46 | 49 |) 55 | G2) 71 | 86 97 
10 | 15 | 20 | 21 | 24 | 27 | 29 | 32 | ; 36 | 3s ; } 55 y 
11 | 15 | 20 | 21 | 24 | 28 | 29 | 32 |] 34 | 36 | 40 | 42 | 44 | 46 | 50 | 57 Z 
12 | 16 | 20 | 22 | 24 | 28 | 30 | 32 | 34 | 37 | 40 | 42 | 44 | 47 | 50 | 58 | 63 | 75 | 89 | 107 
12 | 17 | 20 | 22 | 25 | 28 | 30 | 32 | 34 | 37 | 40 | 42 | 44 | 47 | 51 | 58 | 64 | 76] 90 | 115 
12 | 17 | 20 | 22 | 25 | 28 | 30 | 32 | 85 | 88 | 41 | 42 | 44 | 47 | 52 | 59 | 64 | 78 | 91 | 
13 | 17 | 21 | 23 | 25 | 28 | 30 | 32 | 35 | 38 | 41 | 42 | 45 | 47 | 52 | 59 | 64 | 80 93 
13 | 18 | 21 | 23 | 25 | 28 | 30 | 32 | 35 | 39 | 41 | 42 | 45 | 48 | 53 | 6O | 66) 81 | 94 | 125 
14 | 19 | 21 | 23 | 26 | 29 | 31 | 33 | 35} 39 | 41 | 43 | 45 | 48 | 53 | 61 | 69 | 84 95 | 

| | 
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0 as origin. The 2000 seeds weighed are distributed as follows: 


| | | 
6 7 8 Ge LON SN EN To LOM NLS, 


| 


219 396 | 352 290 | 212} 147| 106} 76 | 55 | 39 | 27 | 26 | 29 
| 


Os 
y 
HX 


* This is also the conclusion reached for Sanguinaria. See Harris, Biometrika, Vol. vit. 
pp. 327—328, 1910. 

+ Class 1=0:0—0-5 gr., class 2=0-6—1°0 gr., etc. For details see final paper, Annual Report 
Missouri Botanical Garden, Vol, xxi. 1912, 

Biometrika 1x 58 
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With Sheppard’s correction this gives* in units of weighing 
> (@')=87787, > (e2)=56350, 
> (a')/N =4:389, o? = 8'828346. 


From Table III S[> (2) P = 510496, 
(510496 — 56350)/18000 — (4389)? 
and we have r= 3:328346 ="6a9) 


agreeing exactly with the result from the conventional symmetrical table which 
will be published and discussed elsewhere. 


Illustration (III—Ab) 4. Fraternal Correlation when the Number of Com- 


binations is large. 


As material for an illustration of the calculation of fraternal (or rather 
sororal) correlation when the numbers are large, Dr Lutz kindly gave me the 
measurements of wing lengths in units of jrd mm. for the females in fifteen 
families of the Pomace Fly, Drosophila ampelophilat. Table IV shows the 
distributions of wing length, the class centres only being indicated. Using 
the method of a former note§, I formed the symmetrical intra-family corre- 
lation surface shown in Table V. From this I deduced for the weighted 
population : 

l= 66°200653, o,=7:107332, 
and for the first rough moments about 0 as origin of the individual arrays the 
numbers given in the last column of the table. Multiplying up by the grade 
of the array and summing simultaneously on the machinel|, I got 


Si) 338912 
whence r=(>(U/l)/N y/o? = 15883. 
But, entirely without. the labour of forming a heavy table, I find, from the 
first and second class (family) moments of Table IV, 
Sn(n — 1) = 28148, 
S[(n—1) = (UV)] = 1863416, 
S ((n —1) 5 (1%)] = 1238559412, 
S[S ()P = 126124937, 
S207) = 2733805. 
* Of course, where all the classes comprise the same number of individuals the weighting is 


uniform and the weighted population constants the same as are those calculated from the unweighted 
distributions. 


+ 


Table III may be conveniently verified by the agreement of its summation with this number, 
See Publications Carnegie Institution of Washington, 143, pp. 38—40, for some details, 
American Naturalist, Vol. xuv. pp. 566—571, 1911. 

American Naturalist, Vol, xurv, pp. 693—699, 1910, 
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TABLE IV. 


Wing Length in Families of Drosophila. 


| 
Family | 57 | 59 | 61 | 63 | 65 67 | 69 | 71 | 73 | Totals | (2) Si) 
eae ee eee ee 
31 a el a 9) 8 | 12 Ga eae 2053 136183 
32 : | Wee Gl Oe ol a= 12 | 830 57420 
Si | ie eae a Pee 5 | 15 9! 4] 4] 42 | 2832 191426 
39 ==) a ol ay | = oy ||) a || ae) al 50 | 3348 224634 
Gy | RE |e 6| 18 On | 37 | 2469 164877 
Dome ye Ge 1G | | 4) ea 3218 215928 
Oo epee ie 28 Pee Th 19. as) Al 54 41 3662 248654 
oN =e (=a) 35, 5 A | 3 =) || eae | eee 18 1166 75658 
oe = |= |) 6 | 12 16 7 Sail 52h 54 3538 232270 
GSC ee OL 1G, | 19 |. 91 6} 2] 53 3495 230749 
587 eee os eect |e ig | 1246 86274 
eCm el | A |, 90: | 1S Se | 1 BS 3525 234613 
aoe |e | Dalle lae elt ie, sr ieese | Sealy AG 2974 192646 
1193 2} 12 18 16 5 | — | — 53 | 38465 226749 
Ooc mee =| (4) || 18 || ~21 8 ja Peo eee ns) | 3348 215724 
621 | 41169 2733805 | 
\ 


Sororal Correlation for Wing Length in 


TABLE 
the Vinegar Fly. 


Wing of Second Sister. 


57 | 59 G1 C3 oy G9 Totals | = (2,’) 

= : : | 

x | | | | 

- 57 2 3 11 19 41 | BY) 21 9) 4 161 10605 | 
D 59 33 — A 15 29 41 24. 11 5 Iso 8977 
aS 61 11 7 74) 253 357 278 147 57 3 1187 77733 
ie 63 19 15 2S ee OUN | wi2a4ele HOSS 521 157 20 4084 268002 
fy 65 4] 29 BoM loa | 2OL2a D128 aa}: 322 BY tooo 483441 
Ons, o7 55 41 278 | 1085 | 2128 | 2558 | 1568 427 87 8227 | 945433 
2 69 AN hay t 1568 Dilte 440 67 5145 343643 
oe 5 158 | 31] 1608 107682 
cS To. é 31 12 266 17900 
Totals | 4084 28148 | 1863416 

Whence, writing the formula for 7 without reduction, 
S[z (U')P—SE ( Co ase Wz (t 
S[n(n—1 ~\ Sin(a—1 ) : 
bare ln (n 1) 7 _ 15888 


LEE 


S{m—1)=@)] _ 
S [n(n —1)] 


S{a-)2W)] 
( S[n(n—1)] 


aaa 


agreeing exactly with the value deduced from the table. 
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While the case is given as an illustration of method only, chosen primarily 
because of the small number of families, it is of rather unusual biological interest. 
These families were from inbred stock (sister by brother) derived originally 
from a single pair, thus representing as nearly as possible among unisexual 
organisms a “pure line.” Yet we find a sensible sororal correlation*. I hope 
Dr Lutz will find time to collect really adequate data on this interesting 
problem. 


I illustrate on this data one of the most convenient features of the methods 
here described. If, for any reason, one wishes to compare the values of corre- 
lations determined with and without the inclusion of certain classes, the various 
determinations can be quickly made by the simple addition or subtraction of the 
summations of weighted products. For example, suppose one wishes to test the 
effect of omitting families 32, 575 and 587. S[n(n—1)] will be reduced by 


IZ@xl14+2x18 x17=744; 
the first weighted population summation by 


11 x 830 +17 x 1166 +17 x 1246 = 50134; 
the second by 
11 x 57420 +17 x 75658 +17 x 86274 = 3384464, 


and the product summation by 
(830? — 57420) + (1166? — 75658) + (1246? — 86274) = 3381620. 


Deducting these from the comparable values given above for the population and 
calculating physical constants on the basis of the new weighted population J, 
we find r =:1419, 


Illustration (III—Ab) 5. Direct Homotypic Correlation. 


The records of the number of ovules per -pod in twelve individual trees of 
black locust, Robinia Pseud-acaciat, may illustrate the calculation of homo- 
typic correlations. The series is selected because of the wide variation in n. 
Naturally the smallness of the number of classes which renders the material 
convenient as an illustration of method, makes it of little value biologically. 
The seriations of numbers of ovules per pod and of numbers of seeds per pod 
appear in Tables VI and VII. The essentials for the calculation of r appear 
in Table VIII. 


* This may not of course be due to heredity. The environment of the unlaid eggs or differences in 
the experimental conditions during the larval period may be its source. But Dr Lutz considers sensible 
environmental heterogeneity out of the question. 

+ The correlation between the number of ovules and number of seeds developing per pod has been 
discussed for the combined pods of this series in Biometrika, Vol. v1. pp. 441—442, 1909. 
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TABLE 


Ovules per Pod in Robinia. 


7 | 8 | 9 \eto*| a | 12 

1 | 3/12! 24] 21) 28] 20 
oe) Soe oullae) 4 6| 10} 11 
deeiee Op esily 13.1351 Obi |. “96, 
ee ecnh 2 te) 31.) 25 |) 1a 
Sh 13) 25) OG St 
eS ia || 8a.) aT) 95 
= a Gi TO) Weales5 
a ees 2| «5-92 
ee a so: ait lb 4290 98 
Sees lh 81 3 5 | 92] 15 
eel as Sa Gi ie | 20 

z Qi) Ore Ay | 

11 | 30 | 103 | 199 | 245 258 | 


TABLE 


14 | 15 7 as 

ih 

6 1) 2 

1 = = 

2 < = 

5| | 1) — 

1| 2 

16] 9] 1/.3)— 

35 | 31) 25| 6) 4| 
14) 9] 2;/—] 1 

5 4 2 1 | — 

7) 3] 2)/—/ 2} — 
36 | 34 | 33 | 20/11] 6 
135 | 99 | 71 | 32 | 20 


Seeds per Pod in Robinia. 


Totals | 


122 


Tree 
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Totals 


122 
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Feeding the entries in Table VIII into the calculating machine and copying 
off the results I find, for population constants weighted in (2 — 1)-fold manner : 


S [n(n —1)] = 181152, 


S[(n —1) 2 (0')] = 2259669, 
S [(n — 1) & (0?)] = 29203787, 
S[S(s)P = 13114598, 
S [2 (o')P = 28866082, 
S[(v — 1) > (s’)] = 1468970, 
S [(n — 1) & (s?)] = 14864330, 


O 


2 
orm 


8 [2 (N= 
S[E()] = 


s 


9 
Os 


where o and s indicate ovules per pod and seeds per pod. 


= 12473884, 


5°613725, 
101750, 
219072, 
8109047, 


= 137530106, 
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These give: 
(S[= oP — S[E (0?) ]/S [n(n = 1)] - & = 459 
[S[=(s)P —8 [= @)U/S [n(n = DIF _ ggg 


Os 


goat 


S1So 


The number of individuals is too small to attach much importance to the 
result but the agreement with other series of Leguminosae is good*, although the 
correlation for seeds maturing is frequently considerably smaller than that for 
ovules formed. 


TABLE VIII. 
Summations for Individuals of Robinia. 
] i Pan snare. ,!!C!C!!C~«*d 
| Number Sum of | Sum of Ovules Sum of Sum of Seeds 
of n | n(n—1) Ovules, Squared, Seed, Squared, 
Tree = (0’) = (0” = (s’) z (s) 
| j 7 ee aif jj i 
il 122 | 14762 12738 13649 912 7394 
2 64 4032 796 10256 602 6522 
8 111 | 12210 1183 12839 595 3543 
4 102 | 10302 1064 11310 586 3938 
5 122 14762 1378 15896 864 6918 
6 120 14280 1299 14311 555 3091 
7 120 14280 1490 18850 713 4819 
s 159 25122 2251 32331 1495 15117 
9 128 16256 1580 19816 860 6894 
ee) 78 6006 | 946 11690 457 3211 
| merael 105 10920 1245 15135 984 9642 
| 12 196 38220 2875 42989 2347 30661 
|— S| —_—__—_ |- 
Totals 1427 | 181152 17380 219072 10970 101750 


Illustration (III—Ab) 6. Tests of the Existence of Sub-races. 

O. H. Latter in testing for differentiation in Cuckoo’s eggs according to their 
foster parents, determined the probable errors of the difference between the means 
of the several “gentes” and the means of Cuculus Canorus as a species. But one 
may also test for the differentiation of the foster parent “ gentes” from the species 
as a whole by determining the intra-class correlation for egg dimensions. I am 
especially willing to do this since Latter tested the significance of his divergences 


aos 


by the formula ‘67449 ee + N° whereas Pearson’s formula 
n 


Seas 2n\  n(M— my 
ee vs Foea ie v= 


would have been more suitable but had not then been published}. 


* Pearson, Phil. Trans. A, Vol. cxcvit. pp. 364—379 ; Harris, American Naturalist, Vol. xiv. p. 570, 
+ Biometrika, Vol. 1. pp. 164—176, 1901; Vol. 1v. pp. 363—-373, 1905. 
£ Ibid., Vol. vy. pp. 181—183, 1906. 


Table III of Latter’s second memoir furnishes the data. 
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‘5 mm. we have the results given in Tables IX and X*. 


Totals 


Length of Cuckoo's Eggs. 


TABLE IX. 


459 


Grouping in units of 


Hl | | | | | 
Foster Parent =] 38 | 39 | 40 42 | | 44. | 45 | 46 47 | 48 | 49 51| 53] Totals | 5 (0) > (0) 
| | | | 
l 
Robin o90 566 || == | 8 9 | 13 | 20 6) V1} 2 | 1 78 | 3559 | 162819 
Garden-Warbler... }]—|-—| 2 | 15 [LOM 25 r| 1O) ae = i 91 3988 | 174992 
Meadow-Pipit ...J—| 1) 1 4 | 27 | 19 8 a Del 2a —|— 86 3837: 171489 
Wien! ©... ae [——|—] 7 14 9 Gj By 2) — || 54 2315 | 99433 
Redstart ... oe f—|—] 1 3) 21 See del 6 5 1 | —|— 64 2869 | 128821 
Sedge- Warbler ...[—/-—| 2 | 4 VOR) GO Griese ay —|— 48 | 2120 | 983788 
Barred-Warbler ... }—,—-| 1 4 Wl}; 7] 37 —J] 1J— —|— 35 | 1529) 66899 
Whitethroat ...f 1 |—|— 7 SUN ee teal ete || ESS ay 1613 | 70457 
Marsh-Warbler ...}—|—| 1 | 3 NZ hal] 1 1 — 37 |) «1641 72867 
Red-backed Shrike | — | — | 2 | 20 | 2 90 | 67 | 60 | 23 9113 |—|—| 307 | 18716 | 613606 
White-Wagtail ... J—|—-|— | 10 29 | 33 | 23 6 4; 1] a 115 5145 | 230453 
Pied-Wagtail ...J—|—| 2 7 20 | 13°} 22 5 |/—|1|—|—] 85 3804 170532 
Reed-Warbler ...}—|-—| 2 a 13 7 | 13 9 at | 2 —|— 66 2959 | 132939 
Tree-Pipit ef} —|—|— 6 Ty) 6) 9 —|— 48 2174 98648 
Hedge-Sparrow ... | — | —|— 2 14} 13 | 18 Dae —|— 58 2629 119357 
Yellow-Ammer ...]—|—] 1 6 | |) We Bi 8 | ; 31 1377 61293 
| | | Soe EREASee | anda aoe aoa | eroneessaeeseeer 
| 1] 1 | 28 122 | 126 | 315 | 244 | 212| 81 | 63 | 12 | 7 | 1)|1 41240 55275 | 2468393 
TABLE X. 
Breadth of Cuchoo’s Eggs. 
Foster Parent | Zoo R SOMES NW Ge soe || oe \igo | 6 Totals | = (0’) =(b) |} 
Robin 600 ane 2 78 2621 88233 
Garden-Warbler ... 8 91 2975 97355 
Meadow-Pipit ... 4 86 2863 95425 
Wren me S06 lal 54 1750 56810 | 
Redstart ... 2 64 2111 69703 
Sedge-Warbler ... 8 48 1577 51881 
Barred-Warbler ... 5 35 1139 37119 
Whitethroat 5 37 1201 39041 
Marsh-Warbler ... 3 37 1210 39602 
Red-backed Shrike 0 307 10149 | 335833 
White-Wagtail 5 115 3851 129177 
Pied-Wagtail 5 85 2804 92590 
Reed- Warbler 4 66 2192 72898 
Tree-Pipit 2 48 1600 53398 
Hedge- Sparrow 1 58 | 1944 | 65220 
Yellow-Ammer 1 31 | 1021 33657 
Totals 1240 41008 | 1357942 


* Since in the method of computation adopted we deal with the summations of first and second 
powers of actual measurements, not with their deviations from arbitrary origins chosen to lighten 
computation, we can save materially in the size of the summed products by reducing to half mm, units 
instead of mm. units, i.e. 15:0=30, 15°5=81, etc. 
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Summarizing the results from these tables as under illustration (III—Ab) 5, 


we have: 
S [n(n — 1)] = 159484, 
Si@—1)=.@)) = 1201s; l = 44°644967, 
S [(n — 1) 5 (2)] = 818370242, of = 3:078842, 
S(S()P = 320370295, S[S (U2)] = 2468398, 
S[@—1) > @)] =5277568, b = 33:091520, 
S[(n — 1) 5 (b2)] = 174844798, oy = 1:266907, 
STS OYE = 176007866, S[S (62)] = 1357942. 


Whence by (vi, ix) as illustrated in greater detail under (III—Ab) 4—5 above 
I get: 
Thi, = ‘046 ap ‘019, r/E,. = 2°41, 
Tp,b, = 936 + 019, Fi] pS OF 


where the probable errors are calculated on the basis of the actual number of 
eggs, or N= 1240. 


Such a test as the above can never replace the R.L.D. (relative local difference) 
or the R.R.D. (relative racial difference) as the case may be, of Tocher and Pearson. 
But there are cases, as for instance the present and several unpublished series, 
where, now that a (relatively) rapid method of computation is available, it may be 
very useful. 


Illustration (III—Ab) 7. Inheritance of Milk Yield in the Male Inne. 


The test of the inheritance of a character like “milk yield” through the male 
must be made either by correlating between the milk yield of a cow and her 
parental granddam, or by determining the correlaticn between the daughters of 
individual bulls. 


The methods here described are particularly useful for cases of this kind, for 
while the number of daughters in a family may not be large, the range of variation 
in yield is apt to be very great thus rendering the tables cumbersome in size, 
whether or not heavy in number of entries. 


As an illustration I take a series of data from a paper by James Wilson, M.A., 
B.Se., “On inheritance of Milk Yield in Cattle*.” One of his tables (pp. 108—105), 
gives the milking record of the daughters of eleven Red Danish bulls. Our prob- 
lem is to determine the correlation in milk production between the daughters of 
the same sire, in order to ascertain whether there be an inheritance of milking 
quality in the male line. 


The data are of course very rough since they are apparently for the most part 
recorded to the nearest fifty gallon point. The yields range from 650 to 1400 
gallons. Omitting all records indicated as questionable, I sum the yields and 


* Wilson, J., Scientific Proc, Roy, Dublin Society, nu. s, Vol, x11. pp. 89—113, 1911. 
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the squares of yields directly from Professor Wilson’s table and enter the totals for 
each bull in Table XI. This gives 
ya lO O10: 


The probable error is based on the actual number of daughters, n = 76, not the 
weighted number, 
S [n(n — 1)] = 588. 
Whether or not this slight correlation in yield (y) is statistically significant, the 
case illustrates a very convenient method of dealing with such problems. I hope 
to discuss the biological sides of the question in more detail elsewhere. 


TABLE XI. 
Milk Yield in Daughters of Various Bulls. 
| 
: Numb f| Total Yield See Nese | a one | 
Sire iaaeuers Sy) (n-1) = (y’) | =(y?) (n-1) J (y?) | 
Maxi ‘ 11 8900 89000 7310000 73100000 | 
Stamfadern a 9 8000 64000 7195000 57560000 | 
Braendekilde Max 4 3800 11400 3625000 10875000 
Vigfus as oie 6 5900 29500 6090000 30450000 
Osvald Ejersminde 6 5500 27500 5120000 25600000 
Gunnar : nee 3 2650 5300 | 2362500 4725000 
Mazeppa IT 6 6400 32000 _ 6875000 34375000 
Tordensjold 5 4650 18600 | 4372500 17490000 
Trym | 7 6500 39000 6165000 36990000 
Taurus LV 13 11350 136200 10022500 120270000 
Ambrosius III 6 5300 26500 4765000 23825000 
76 | 68950 | 479000 63902500 435260000 
S [n(n — 1)]=5238, S [Total yield? = 491477500. 


B. Cross Intra-Class Relationships. 


If # and y be two measures on the same individual, and if correlation be 
determined between the individuals of the m sub-groups or classes constituting 
the general population, then as pointed out above, rz,7,, 7y,y, ave direct intra-class 
correlations while 7,,,, is a cross intra-class relationship. 

The number of combinations will in both direct and cross intra-class relation- 
ships be S [n(n —1)], since in the direct correlations one cannot correlate between 
a measure and itself and in the cross correlations one must not include the 
relationship between the a and y of the same individual—the organic correla- 
CLOG Fp9): 


The value of this direct organic correlation is given by the moment coefficients 


STE (2) Sm), STE@)/S@, STEY)VS, SE Y)/SM), --09 
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and by the mean product moment 
PSH Be C47 OI Se (0) bare RaneennnBe aR aeBNObAnaso sacs 50 (x1) 


taken about 0 as origin. Most simply they can be determined by the formation 
of an ordinary organic correlation table for the whole population which will then 
be described by N, S(a’), S(a), S(y'), S(y?) and S(a’y’), where S denotes a 
summation for the individuals of the population. This correlation will generally 
be wanted for its own sake. 

Again two cases are possible (a) all the m classes are equally large, (8) n is 
variable. The first two moments are given by the same formulae as in the direct 
intra-class correlation above, (viii)—(ix). The mean product moment around 0 is 


{S[3(@') 2 (y)]— SLz (e’y’ YS [0 (0 — 1), cceecceseeeeees (xil) 
or with a denominator of m [n(n —1)] where the m classes are equally large. 


Illustration (IITI—Ba) 8. Cross inter-locular (intra-ovarial) Correlation in 
Hibiscus. 


For a purpose quite foreign to my present one, the correlation between total 
ovules per fruit (== (0’) for locules) and total seeds per fruit (= (s’) for locules) 
has been tabled for the 1905 series of H. Syriacus. The Table has 17 x 36 entries, 
hence we reduce to the condensed form in Table XII. The organic correlation 
between the ovules and seeds of the same locule is shown in Table XIII. Taking 
the product moments of these two tables about 0, and subtracting that for locules 
from that for fruits to remove the products of the ovules and seeds of the same 
locule, and. to reduce V from S(n?) = 25000 to S[n(n—1)] = 20000, I find 

S [S (0) = (s’)] = 637491, So's’) = 128377, 
and, substituting physical constants from illustrations 1 and 2, I find 
S {[¥ Co’) = (s')]—S (o's )}/20000— 68 25-4557 — 64648 x 3:9056 


a Gas =~ -g99166 x 175644200a toes 


n 


os, 


TABLE XII. 


Correlation for Total Ovules and Total Seeds per Fruit 
in Hibiscus, 


Ovules f Total Seeds | Ovules if Total Seeds 
23 — as 32 81 1512 
24 3 41 33 80 1575 
25 2 26 3h 80 1436 
26 4 51 3d 85 1746 
27 18 285 36 54 1256 
28 45 708 37 54 1129 
29 107 1847 38 34 807 
30 188 3727 39 27 691 
bl 125 2322 AO 13 369 
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TABLE XIII. 
Correlation for Ovules and Seeds per Locule in Hibiscus. 


Seeds per Locule. 


Ghee ne esd Mieco ise ie alee) Totals 
2) | 
8 2 Qh Wie 3 
5 3 <P ee | 5 
Be 4 A | sy ok} a) aly 6 63 
z 5 G2 176 1109 |) 86. a4 | = elses 
6 49 | 169 | 315 | 490] 573 | 595 | 355 | — | — | 2476 
g 7 30 | 97 | 167 | 221 | 237 | 244 | 235 | 1083 | — | 1334 
a 8 14) 40] 81] 84| 107 | 108 | 139) 107 | 74 | 754 
© | "rotals | 105 | 365 | 669 | 921 | 1009 | 918 | 729 | 210 | 74 | 5000 


Comparing other constants for Hibiscus :-— 


Direct Intra-class for Ovules, r= °3843 + 0081, 
Direct Intra-class for Seeds, y= '5557 + 0066, 
Cross Intra-class for Ovules and Seeds, 7 = °1320 + °0093, 
Organic for Ovules and Seeds, p = °2722 +0088. 


In all cases the probable error is calculated on the basis of the actual, not the 
weighted number of locules. 


Illustration (III— Bb) 9. Cross Homotypic Correlation for Number of Ovules 
and Number of Seeds per Pod in Robinia. 


From Tables VI, VII, and VIII supra, we deduce S[% (0’) & (s’)] = 19070536 
which taken in conjunction with S (o’s’) = S[ (0’s’)] = 141476 taken from the 
organic correlation table for ovules and seeds in Biometrika (Vol. VI. pp. 441—442) 
and the physical constants 6, 8, 0), os, given under illustration (III—Ab) 5, gives 


os, = {({S[2O) &(s’)] — S[Z (0's) ]}/S [n (n — 1)] — 68} /oyos = 383. 
This may be compared with 7,.,=°452 and 71,.,='449. The discussion of the 


biological significance of the results from this and the preceding illustration must 
be reserved. 


Problem IV. To Determine Direct or Cross Inter-Class Correlations from the 
First Two Moments of the Individual Classes. 


Let # be the measure of a character in an individual of one class of p members 
and y be the measure of the same or a different character in an individual of 
another class of g members associated with the first for some logical (eg., 
biological, or sociological) reason. The coefficient calculated from a table in 
which every « of a first class is compared with every y in the second (associated) 
class may be designated as an inter-class coefficient of correlation. 


The number of entries in an inter-class correlation surface will be S (pq), where 
S indicates a summation of classes. > (a’)/p, &(w”)/p, U(y)/q =(y")/q are the 


59—2 


464 Tntra-Class and Inter-Class Correlations 


first two moment coefficients of individual classes, [> (a’) = (y’)]/pq the product 
moment coefficient for an associated pair of classes and 


SIE@QIO)ISGO) eee (xiii) 


the rough product moment coefficient for the whole population. 


For every first class of p individuals there is an associated second class of q 
individuals, Hence, the individuals of the first class are always weighted* in a 
q-fold manner and those of the second class in a p-fold manner. The population 
constants weighted for the calculation of 7 are given by the rough moment co- 
efficients about 0 as origin, 


Sle) Sq), Sle @)I/'S (oq), .- ae eee (xiv) 
S[p=y)/S (pq), “Sloe yD] Sq). serene (xv) 


Since a and y may be measures of the same or different characters it is clear 
that the formula is the same for direct or cross inter-class correlation. 


Illustration (LV) 10. Racial Heterogeneity and Homogamy in Man. 


It has sometimes been suggested that homogamy for physical and mental 
characters in man is due to (a) the existence in the population of numerous local 
races, (b) the excessive frequency of intra-racial marriages, attributable to pro- 
pinquity merely, and (c) the pooling of records from many local races in the same 
correlation surface. 


A study of the literature of assortative mating convinces one that so simple an 
explanation is inadequate. Nevertheless, it is very important to give full weight 
to any factors tending to bring about a spurious correlation for assortative mating. 


One way of testing the question is to make random matings within the local 
races, pool the results for a series of districts, and determine the correlationt. The 
influence of the local differentiation can be read off in the inter-sexual correlation 
coefficient thus secured?. 


A convenient series of materials for stature is that gathered by Tocher§ for 
the counties of Scotland. To be sure, his measurements are on individuals who 
have become insane, but data for the normal population are lacking. 


We may work from the table of constants (Biometrika, Vol. v. Supplement, 
Table XIX), taking populations of arbitrary size, say 100 males, m, and 100 


* When it is possible to collect data so that either p or q, or both, are constant throughout the 
m pairs or classes the work is obviously considerably simplified. Unfortunately such cases are rare in 
practice. 

+ This was actually done for parish churchyards in the cooperative study of assortative mating for 
duration of life. Cf. Biometrika, Vol. 11. p. 485, 1903. 

+ Of course our problem has some similarity to but differs fundamentally from that considered 
by Tschepourkowsky in his study of the resemblance between man and woman. Cf. Biometrika, Vol. tv. 
pp. 161—168, 1905. 

§ Tocher, J. F., ‘‘ The Anthropometric Characteristics of the Inmates of Asylums in Scotland,” 
Biometrika, Vol. v. pp. 298—350, 1907. 
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females, f, from each district. Then for a district, }(s’)= 100 s, the population 
means are given by S[%(s’)]/N, the population standard deviations* by 

s.D2 = S[100 (8 + o,)]/N —[S (100 8)/N}, 
where s represents stature in males or females as the case may be, the bars and a’s 
indicate county means and standard deviations and N= S(n)= mn = 22 x 100. 


The population constants are : 


For Males : 

S (100s) = 145080, N= 2200; 

S (100 8) = 9567888, s =65°945, 

S (100 o,?) = 17335-08, s.D2= 8116546, 
For Females : 

S(100s) = 134760, N = 2200, 

S (100 8’) = 8255232, Ss = 61:255, 

S (100 o,?) = 13992:05, Sp2—= 6618921. 


These will suffice for getting the correlations, since all classes comprise the 
same number of individuals and no change in the s or oa, is introduced by 
weighting. 

The totals of the correlation surface are 22 x 100 x 99 = 217800 for the intra- 
patrial correlation for stature of random pairs of males and random pairs of females, 
and the rough product moment coefficients are given by 


{100 S(100 s?) — [8 (100s”) + S(100 o2)]}/S [n(n — 1)], 
whence, for males, 27 =:019, 
for females, + = °029. 


But for the inter-sexual correlation between random pairs of men and women for 
the several counties, pooled together, there are mn’, not m [n (2 —1)] combinations, 
since each male may be compared with all the females of the district, while in 
intra-sexual correlations the individual’s own measure must be deducted. For 
random pairs of men and women we have, therefore, for the unadjusted product 
moment 
S[% (8‘m) = (s’s)] = 888716500, N = 220000, 
whence 
r= (4039°620454 — 65:945455 x 61:254545)/2°848955 x 2°572726 = 022, 

The result just deduced taken in comparison with that secured in a similar 
manner for the several parishes of the Wensleydale districtt, seems to indicate 
that the geographical differentiation in the population plays a very small part in 
the production of the correlation between husband and wife. 


* In getting S[100(s?+.o,2)] for the standard deviations we can save the labour of mental additions 
of large squares by taking separately S (100s?) and S(1000,2). We have then only to multiply 
S (100s?) by 100 to get S[100s]?=S([z (s’)}2. S=(s’) is of course given by the second moment calculated 


for the population s.p. Thus the squaring and summation can be quickly done simultaneously on any 
of the usual types of calculators. 


+ Biometrika, Vol. u1. p. 485. 
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Problem V. To Determine Fractional Direct or Cross Inter-class Correlations 
from the First Two Moments of the Individual Classes. 


Under (IV) every w of the first class is compared with every y of the second 
class. In actual practice, however, cases are frequent where all the possible 
comparisons are not made. Such relationships may be designated as fractional 
inter-class correlations. They are more difficult to determine than the simple 
inter-class constants but they are of very great importance in many cases. Of 
course the method is quite general, but for clearness I take the determination of 
the avuncular correlations as an illustration. 


If « be the measure of a character in a parental array, family or sibship of p 
individuals and y the measure of the same or a different character in the offspring 
array (or family or families) of g individuals, then the inter-class correlation in- 
volving the comparison of every « with every y in the related classes of the 
population of m classes may be designated as an ascendant (or, if one chooses, 
descendant) correlation of the first order*. From such a table, actually formed 
by the method described under (I) or only expressed algebraically as under 
(IV), one may easily derive the avuncular correlation—ze., that between “ uncles 
(aunts) ” and “nephews (nieces) ’—as follows : 


In such an ascendant correlation, the offspring individuals are associated with 
all the members of the parental array, both their own parents and their parent’s 
siblings, or there are S (pq) combinations. 

Hence it is given by formulae (xiii—xv) under (IV). To obtain the avuncular 
correlation we have only to deduct from the five fundamental summations of the 
ascendant correlation those for the relationships between (weighted) individual 
parents and their S(q) individual offspring, and proceed to the correlation of r on 
the basis of S (pq — q) instead of S (pq) individuals. 

But the direct parental correlation is practically always wanted on its own 
account. It is most easily obtained from an ordinary or condensed correlation 
table, or by the method described under (II). Designating by the subscript 7 the 
measures entering into the correlations for individual parents (weighted with their 
offspring) and their offspring, we have for the avuncular relationship (the fractional 
inter-class correlation) by simple subtraction, 

Sz (#') = (y)] — S(ai’y’) 
S (pq) — SQ) 
for the product moment coefficient, and 


S[q=(@’)]-—S(w/)  — S[g® (#”)] — S(#i*) 


S (pq) —S(q) ’ S (pq) = S@ ) onoodopasmad (xvil) 
S[p=y-SY%) Slp=Y7AI-SY) Guat 
S(pq)-S(q) ’ S(pq)-S(q 


for the moment coefficients. 


“ If correlation between parental and offspring arrays be designated as of the first order, that 
between grandparental and grandoffspring arrays is of the second order, and so on. 


| | | | 

-|-| of |osor| ose | ze}-|-|e | etjetie | - |] -)]-|-l-/- 
SS aGG ei Obie wey CealeG |= |) = eon cle) FG fa il ee mel Ta) ae 
-|-| 8g | 6228 | THe} 9s|/-)/-|T |r |9 | 8 |e |e |—j-j-]- 
|= lees MOIGE | Coe Ie Gi a= | ae RCT 120) es oly Oe ali 
Sl) Oe COGN OSC Neti le oe) GS Coal Cia) le ee Veet 
Pane eg eaelr c6er | Ay p= | De | re | SU |) Le le 
I—|=| 4g VCrip | Livy | l/—|—)]36 19 | Sek | Tt | = | alll 
| ee CGGS | GIT. prs) | te gen pee | lee Gee flee 
—|-| g¢ | seze| 98¢ | OF | —|-|% |8 | FTI18 | 42 | — IT [-y-]a 
Tle fe Josey! OLY | OF |— le) | 8h 6h) € fl | — palais 
el Og Miia | Gee | 1h |= = | ee | AG Ie a el 
—|—| 6g [| ogee | 898} 1h} -—|—-| — |e | Tt] FT) 4 | T = 
il-|-| 9g | 988c | OPE | Ih} -—|—|] -— | & |S | PL/| OL] 6 |r 
-|-| 2 [tee | tee | ]-|-|- | - | % | et) | + I-|-|- 
= te OG) NAC RGe: (eCSo Oct pera Mlalling Nee a1 eC aS pe mey  a Taal 
Sh 39e Pele, 6088 (ul yells OG 1 GRRL | Se 0) = |e \en 
|-|-| ve | eoze | O96 | se} —|-| — |S | et) at; 9 Ft | p-jmla 
Sle ee peo S06 | Te t= eta = | OTe ces b= 
=| lave SROrLe | Sren6e) == k= ere eee OT seals S| aS 
i ahey WEG LES HGS | = SS OO Ge Ge KE leah 
SS BOs. | cares 1eGe Goa =| ala Oma arantcreay ale |e Waal cael 
SII GE OW WET, (peCle ee. |=) i ee Sel nie ecu) Gaol 
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These formulae lend themselves readily to practical manipulation. The rough 
moments for the parental-offspring surface are generally available; the others 
are obtained by rapid machine calculation. 


Illustration (V) 11. Avuncular Correlation in Garden Beans. 


? 


correlation in a 
series of forty families of garden beans—The Navy H series of former papers *. 
Table XIV gives the necessary data. The first part shows the weight of the seeds 
from the 40 parents, seriated individually with the sums and sums of squares of 
their values. Only a portion of the seeds (roughly 10) from each of these mothers 
were planted, and give the offspring arrays shown with their summations in the 
second part of the table. 


As an illustration I take the calculation of the “avuncular’ 


The direct parental correlationt—that between the weight of the individual seed 
planted and the weight of the seeds which it produces—is given in Table XV. 
TABLE XV. 
Seed Weight in Garden Beans. 
Offspring Weight. 


9 | 10 Totals | 

3 | 1 Abs ely AL 
So a C= eno 6 8 4 4 4 
"D 4) =) Teil Sais 21 ge ay ee | |) 
= —| 2/10; 21] 62] 103] 100] 55;19| 4) 3) —| — 
em eh al FN 29) A TS aa) TSA ON OS |e S| eee ee ee 
§ 1 Bee BORE |e LOQN eye lle benno mami ee |e 
S Lo =) Ge VS Sl | SI ACG Mee T2212 oer ee 
mH aha =| 2 7 4 SORE 23a) Onto meee 
aw = ae By ihe leone eae — Il 

| Totals 4126 | 416 | 251 | 


* American Naturalist, Vol. xuvi. pp. 813—343, 512—525, 656—674, 1912; Roux’s Archiv f. Ent- 
wicklungsmechanik der Organismen, Ba. xxxv. S. 500—522, 1912; Biometrika, Vol. 1x. pp. 11—21. 

+ The Editor has called my attention to the possibility of some misunderstandings which might be 
introduced by this illustration and I am glad to add a word of explanation concerning it. 

The chief question which is likely to be raised is: ‘* What constitutes direct parental inheritance in 
a character like seed weight?’’? Some rather complicated biological questions are involved. The coats 
of the seed are a part of the plant individual which produces them—are in fact strict homotypes in 
Pearson’s sense of the term. But the embryo—the cotyledons, hypocotyl and the plumule, the elements 
which constitute the major part of the weight of the seed—is strictly speaking a daughter plant. Hence 
an intra-individual correlation for seed weight is not a true homotypic correlation but a fraternal 
or sororal correlation. Thus from the embryological or genetic standpoint the correlation between the 
weight of the individual seed planted and the weights of the seed borne by the plant into which it 
develops is the true parental correlation. 

There are, however, two important modifying conditions. First, the correlation is between two 
individuals in an embryonic stage, not in the adult condition. It is somewhat analogous to the 
correlation between the larval characteristics of two successive generations of moths, and resembles 
still more closely the coefficient measuring the resemblance between a parent’s weight at birth and 
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Designating the parental generation seeds by w and the offspring seeds by y, I find 
from Table XV, 
> (v7) = 15416, > (y’) = 14097, 


= (#?) = 145400, (y) = 123278, 
> (a’y') = 130794, 
whence, with Sheppard’s correction, 
r = (130794/1668 — 9:242206 x 8°451439)/1:291727 x 1547424 = 1519. 
From Table XIV I find for the ascendant-descendant correlation 
S (pq) = 60756, 
S [q& (#’)] = 564066, — S[ p= (y')] = 514990, 
S[g> (#?)] = 5351080, S[p>=(y")]= 4518074, 
S [2 (#’) = (y’)] = 4798171. 
These give 
r =(4798171/60756 — 9:284120 x 8:476364)/1:340412 x 1559539 = 1334. 
But obtaining the summations about 0 for the avuncular relationship by simple 
subtraction I get for the population V 
S[q (p — 1)]=8 (pq) — 8 (q¢) = 60756 — 1668 = 59088, 

and for physical constants, for “aunts” 


© = (564066 — 15416)/59088 = 9285303, 


ox = V(5351080 — 145400)/59088 — (9°285303)? — -083333 = 1341743, 


> 


and for ‘nieces’ 
Wf = (514990 — 14097)/59088 = 8:477068, 


oy = V(4518074 — 123273)/59088 — (8477068)? — 083333 = 1559872, 
and for the relationship between them 
r =[(4798171 — 130794)/59088 — Zy]/oxoy = +1329. 


a child’s weight at birth. Second, all the seeds produced by the same plant are under closely similar 
environmental conditions. Thus while they are (so far as the embryo to which weight is chiefly due) 
sister seeds, they are far more nearly alike than would he expected on the basis of heredity alone, for all 
are borne in and nourished by the maternal tissues. 

Under these circumstances one may consider either (a) the correlation between the individual seed 
planted and the individual seed produced, or (b) the correlation between the average character of all (or 
of a representative sample) of the seeds of a parental and filial individual to be the true correlation. 

If the latter view is taken one considers the characteristics of the seeds to be determined exclusively 
by the plant on which they are borne (the parent plant). 

If the former alternative be adopted one neglects for the moment the important environmental factor 
introduced by the fact that the whole array of parental seeds are subjected to the limited environment 
of one plant while the offspring array (produced by any single plant) is subjected to the environment of 
another plant. 

It has seemed to me safest to follow strictly the biological homologies in terminology. 
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I note that the parental relationship is the highest and the avuncular the 
lowest. These results cannot be discussed in detail until the final memoir from 
which they are abstracted is published, but it seems to me that they agree 
with Hanel’s data for Hydra as analyzed by Pearson* in attording no support to 
Johannsen’s conclusions concerning the ineffectiveness of selection within the pure 
line. 

If one desires a check, or if linearity of regression must be tested, one may pro- 
ceed by the method already described for the formation of condensed correlation 
tables +. 


Hither ascendant or descendant seeds may be taken as the “first” or subject 
character. Since they are quickly formed, both have been drawn up for the con- 
venience of the reader and appear as Tables XVI—XVII. In them, the first two 
columns give the reduced tables for parents and offspring (or offspring and parents), 
the second two give the frequencies for the ascendants weighted with their 
descendants (or the descendants weighted with their ascendants) and the totals of 
the associated arrays. The final two columns are obtained by subtracting the first 
pair from the second, and give the avuncular relationship desired. Naturally, the 
final summations of these columns may be subtracted if linearity tests only are 
not desired. All the above constants may be checked directly from these tables. 


TABLE XVI. 


PARENTAL ASCENDANT-DESCENDANT AyuNCULAR 
CoRRELATION CorrELATION CorRELATION 
Weight of a 
Ancestral | 

Seed Frequency of | Total | Weighted | Total Weight Weighted Total Weight 
Actual Parents Offs me eal Frequency of | of Weighted Frequency of _ of Weighted 

Weighted with Weicht | Descendant | Descendant | Nepotic Nepotic 

their Offspring | ae | Seeds Seeds Seeds Seeds 

¥ — — 35 306 35 306 

4 — = | 40 340 | 40 340 

6) 8 70 | 294 2526 286 2456 

6 28 210 | 1147 9313 | 1119 9103 

7 79 648 | 3389 27780 | 3310 | 27132 

8 379 | 3188 | 11510 96824 | 11131 93636 

9 489 4085 | 18095 151467 17606 | 147382 

10 375 | 3111 15184 127618 14809 |} 124507 

LT 258 } 2270 | 8247 71591 | 7989 69321 

12 42 396 2406 22963 | 2364 22567 

13 10 119 | 364 3775 354 | 3656 

14 — —— 45 487 | 45 | 487 

Totals 1668 14097 60756 514990 59088 500893 


* Biometrika, Vol. vir. pp. 372—381, 1910. 
+ American Naturalist, Vol. xuvt. pp. 477—486, 1912. 
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TABLE XVII. 


| PaRentTan ASCENDAN?T-DESCENDANT AVUNCULAR 
CoRRELATION CorreLATION CoRRELATION 
laWieielit of |e = Aree eee 
Offspring | . | . 
Seed Total Weight} Weighted Total Weight} Weighted Total Weight 
Actual — of Weighted | Frequency of | of Weighted | Frequency of | of Weighted 
Frequency _— Parental Ascendant | Ascendant Avuncular Avuncular 
Seeds Seeds | Seeds Seeds Seeds 
3 3 CO 30 127 | 1213 124 1183 
4 | 67 263 2514 256 2447 
5 41 | 370 1473 | 13393 1432 13023 
0 111 1012 3901 35562 3790 34550 
7 270 2437 9672 88297 9402 85860 
8 426 3892 15421 142308 14995 138416 
9 416 3841 15190 139924 14774 136083 
10 251 2366 9333 88150 9082 85784 
11 98 939 3642 | 34725 | 3544 | 33786 
2 30 293 1098 10830 1068 | = 10537 
3 12 134 511 STAY 499 5623 
14 2 | 22 82 891 80 869 
15 i 13 43 502 42 489 
mae. | —a ae 
Totals 1668 | 15416 60756 564066 59088 548650 


TI. RECAPITULATION. 


The term intra-class is suggested for the correlation describing the degree of 
resemblance of the individuals of the series of sub-groups or sub-classes into which 
a general population may be divided for some logical (e.g., biological, sociological) 
reason. ‘The term inter-class is applied to those correlations which describe the 
degree of interdependence of variables belonging to the logically related sub-classes 
of different populations. Such coefficients have a very wide but hitherto little 
realized range of usefulness. 


Generally, their determination requires the comparison of each individual with 
the array of individuals constituting its own or its associated class. Heretofore 
the use of such coefficients has been limited, indeed almost precluded, by the 
excessive labour of forming the weighted tables. Methods for the rapid formation 
of the tables by the aid of mechanical calculators have been described, and 
formulae are now given by which such correlations can be determined without 


the formation of tables at all. 


The labour of calculation may thus be reduced to a small fraction of that 
formerly required. ‘The formulae present numerous opportunities for checking 
the arithmetic, but of course certain errors may be more easily detected when 
tables are used. Furthermore the tables, full or condensed, should be formed in 

60—2 


472 Intra-Class and Inter-Class Correlations 


a certain number of cases for the purpose of testing the form of the regression 
curve. 


Finally, in emphasizing the use of Intra-class and Inter-class Coefficients it 
may not be out of place to warn the reader that, as in the case of many other 
statistical relationships, the raw or uncorrected values may give quite erroneous 
impressions of the biological relationships under investigation. Some such cases 
will be discussed later. In the meantime, the purpose of this paper will have 
been fulfilled if the range of illustrations proves suggestive of wider application 
and if the formulae prove as useful to other biologists as they have to workers 


in my laboratory, where they have been very extensively used and have become 
indispensable tools. 


In conclusion I take great pleasure in thanking most heartily the Editor of 
this Journal for the kindly criticisms and suggestions which have materially 
improved the manuscript. 


A SECOND STUDY OF THE WEIGHT, VARIABILITY 
AND CORRELATION OF THE HUMAN VISCERA. 


By M. GREENWOOD, Juyr., ano J. W. BROWN. 
(From the Statistical Laboratory of the Lister Institute.) 


In 1904, one of us published an account of certain applications of biometric 
methods to data obtained from the pathological department of the London 
Hospital*. The object of that essay was to throw light upon the mean values, 
variabilities and correlations of the human visceral organs, in the hope of (a) Placing 
on record results comparable with the numerous studies of anthropologists which 
deal with other organs or dimensions: (6) Providing materials for the study of the 
changes produced by various diseases in the absolute measures and inter-relations 
of the organs observed and, therefore, pro tanto, illustrating the work of selection 
upon the human species. 


In the paper cited, attention was directed to the specialised characters of a 
“General Hospital Population” and it was pointed out that results obtained from 
such a population could not be generalised without inquiry as applicable to the 
race at large. It was also shown that groups selected upon various principles 
within the “General Hospital Population” exhibited marked deviations from the 
prevalent type, a selection of the more healthy organs being attended by a rise in 
correlation and a fall in variation, while a group characterised by a preponderance 
of specifically diseased organs exhibited changes in the opposite sense. 


The difficulty which had to be faced in that investigation was the task of 
ascertaining the characteristics of a healthy “General Hospital Population.” By 
this expression we define the class of persons of the physical, racial and economic 
type to which hospital patients conform but who are not suffering from any disease 
likely to cause death or seriously affect health. The nature of the material avail- 
able in 1904 did not permit of such a selection being made and all that could be 
done was to choose from the data those organs which, so far as the records showed, 
were not obviously diseased. Such cases formed the normal or “healthy ” material 


* Biometrika, Vol. ut. p. 65. 
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of the earlier paper. In Table I we have grouped the results obtained from such 
data; the figures relating to males have already been published, those dealing with 
females are based upon the manuscript records, and are given for the sake of com- 
parison. It will be noticed that in two cases the correlations for the female data 


TABLE I. 
A. Constants calculated from Old Data. Healthy Organs. 


Organ Sex Mean | Standard Deviation Correlation 
| 

Heart... M. 11°04 + ‘07 ozs. 1°923 ozs. *2780 + ‘0329 
Liver ae M. 6044+ °32 4, | 8948 ,, [H. and L.] 
Heart .... F. 1012 -o,, 3:087 ,, | 2163 + 0325 
Liver eo ae 55°23+°15_ ,, 4:298 ,, [H. and L.] 
Heart ... M. 11°25+°:06 ,, 2 OBd= 55 *2654 + ‘0276 
Spleen... M. 529-060, 1:996 ,, [H. aud §.] 
Heart... F. OOD cs O ee 3°422.,, "1881 + :0295 
Spleen ... | F. 6-07+°10 ,, | 2107 os [| H. and 8.1] 
Heart... | M. ieise ls} | 1946 ,, “4004 + *0279 
Kidneys... | M. 12; Ol sen Q ae, 2 O16 5, [H. and K.} 
Heart... F. 9:'45+°09 ,, | DiAAD ee *4465 -++ 0292 
Kidneys ... F. 10°58=2-10%,, 22620", [H. and K.] 


B. Relative Vuriability of Healthy Organs. 


Oreans Sex | Coefficient of Variation 
Hearts with Livers Bers M. | 17°42 | 
Hearts with Spleens... 3 18°42 + mean 17°71 
Hearts with Kidneys ... | 4 | 17°30 
Livers... Bee AN 3 | 14°80 
Spleens ... Pe nae bs 38°21 
Kidneys ... Ate ie Fe 16°80 
| Hearts with Livers Seale ake 30°49 \ 
| Hearts with Spleens ... | 3 34°17 - mean 30:17 
Hearts with Kidneys ... | . 25°86 
Livers... Sas An 7 7°78 
Spleens ... Hor aE elo os 52°69 
Kidneys .=, © “.. oe = 24°76 


are slightly smaller, in one instance perhaps significantly smaller, than the male 
values, while the relative variabilities are in two cases higher among the females. 


It was to be anticipated, from the nature of the case, that the results just given 
would be only rough approximations to the desired values descriptive of a healthy 
“General Hospital Population.” This anticipation has been confirmed by other 
work. Thus Pearson has shown that the form of frequency distribution indicated 
by the graph of “healthy” hearts would suggest arbitrary limitation at each end 
of the distribution, some small and many large hearts having been unjustifiably 
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classed as abnormal*. It therefore seemed desirable to re-investigate this par- 
ticular problem and a special motive for doing so now is that one of us hopes 
shortly to collaborate with Dr Mott in the reduction of similar but more extensive 
data collected in Claybury Asylum. To make this latter inquiry complete, some 
comparison of biometric constants derived from an insane population with those 
yielded by sane stock is essential. The sane population as a whole not being 
available, and the economic strata which provide general hospital and county 
asylum patients being similar, it seemed probable that an extension of the earlier 
inquiry would be of value. Under these circumstances recourse was once more 
had to the pathological department of the London Hospital. We took this step 
with the more confidence because we knew that since the date of the first investi- 
gation the pathological records of the London Hospital had been enormously 
improved, under the supervision of Dr H. M. Turnbull, Director of the Pathological 
Institute. 


Dr Turnbull kindly supplied us with data prepared in accordance with his 
instructions and we desire to express our gratitude to him not only for placing the 
material at our disposal but for much friendly advice and criticism throughout the 
inquiry. 

The data provided by Dr Turnbull include, with the exception of the weight of 
the spleen, all the measurements recorded in the former series and in addition the 
weight of the brain (in most cases), and the weight of the whole body and its 
length. We must, however, remark that the last mentioned datum, although 
included in one of our tables, has been shown to be unreliable, so that no sound 
conclusions can be based upon it. We shall refer to this point later. 


With respect to the methods by which these data were prepared, we have 
somewhat exact details. Dr Turnbull permits us to quote the following passage 
from a private letter addressed to one of us. 


“ Before the bodies are put on the P.M. tables they are weighed and measured 
and the results are entered by the porters upon the tops of the sheets for the 
description of the necropsy. Each organ after it has been removed, cut and de- 
scribed is placed in the tray at the end of the table and left there until Dr Bartlett 
or I have gone through the case dictating the diagnosis of the case as a whole and 
of each organ. So soon as most of the organs have been completely separated the 
porters lift them from the trays, weigh them and write down the weights on the 
large slate in the P.M. room. Occasionally an assistant places an organ in the tray 
before slicing into it; it may therefore be weighed before it is sliced, but this is 
exceptional. The organs are almost always completely separated and therefore 
weighed before Dr Bartlett and I go through the case. In special circumstances 
they are placed in the trays without separation, e.g., in disease of the bile ducts 
the liver, pancreas, stomach and duodenum would be placed together in the tray 
without separation until Dr Bartlett or I examined them. Such organs would be 


* Biometrika, Vol. tv. p. 183, 
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weighed later than the others. In cases of accident it would be extremely 
unlikely that such a proceeding would ever be necessary. You may take it 
then that in the cases sent to you the organs were weighed almost immediately 
after removal and examination by the assistant.” 


This material although naturally less extensive than that previously used is 
far more reliable. Dr Turnbull knew that we wished to study healthy organs and 
consequently the data received by us formed a selection not a random sample of 
the “General Hospital Population.” We have, however, carried the process of 
selection yet further. We chose from the data all the males at ages 25—55 and 
rejected from this number (1) all that had died of a wasting disease, such as 
Cancer or Tuberculosis; (2) all cases in which a possibly pathological condition of 
the heart was noted, whatever the weight of the organ might be; (3) cases in 
which Syphilis might be suspected (those with record of aneurism in particular) ; 
(4) all cases with no recorded body-weight. 


The cases without recorded body-weight have not been considered at all, but 
those rejected from the series on account of (1), (2) or (3) were subsequently added 
to the remainder and the constants of the whole series ultimately calculated. Our 
work therefore falls into two parts, the first being an analysis of some 78 doubly 
selected cases, the second a reduction of about 100 records selected also but not so 
stringently. An examination of the data which are published as an appendix to 
this memoir will enable the reader to appreciate our method; it may be remarked 
that certainly one and possibly two cases not admitted to the “double selection” 
should have figured there (one was left out by mistake), while some of those 
admitted, in particular the epileptics, ought possibly to have been omitted. On 
the whole, however, we think the “double selection” conforms pretty well to 
the type we had in mind. Needless to say, it falls far short of the ideal, since, 
not to mention other points, we have not been able to take account of occupation 
or race in view of the paucity of our material. It may also be mentioned that we 
have other data relating to persons outside the limits of age here adopted; the 
analysis of these cases will be dealt with in another communication. 


The non-medical reader will perhaps wonder how it is that we think it worth 
while to deal with so few cases, and may object that the London Hospital is only 
one of a number of great medical charities and that the material from all of these 
should be pooled and treated as a whole. This would be so obviously the proper 
course were it practicable that we may be allowed to point out the difficulties in 
the way of its adoption. The effective absence of coordination between the medical 
schools, the diverse systems in force and the contempt of statistical data which up 
to recent times characterised all but a minority of the hospital staffs, would render 
any individual attempt to deal with combined records a troublesome and possibly 
futile undertaking. We are not speaking without knowledge of the statistical 
systems in vogue at a large hospital when we say that the present state of affairs 
is unsatisfactory. Every year tons of paper and gallons of ink are devoted to 
recording the experiences, medical, surgical and pathological, of the great teaching 
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institutions, An extremely small percentage of the results have any value at all, 
while even that residuum is not readily accessible. Comparatively little trouble 
and some expense would notably diminish the output of waste-paper and increase 
the production of valuable records. 


Hospital administrators pay little attention to the advice of private enthusiasts 
—and quite properly since compliance with requests for innovation might turn 
hospitals into happy hunting grounds for faddists of all kinds. There are we think 
but two ways in which genuine statistical reforms are likely to be brought about. 
One is through Government or Municipal action, a process which does not (if we 
may judge from the report of the recent Royal Commission on the Record Office) 
necessarily lead to the best results. The second is through the realisation on 
the part of leading physicians and surgeons of the importance which attaches 
to organised statistical data. Perhaps when consumption has been “ conquered ” 
and the infant death rate reduced 80°/,, medical protagonists may be induced to 
turn their attention to a humbler but still important reform. The question is 
really of more importance than might be inferred from the subject of this paper. 
The problem with which we are immediately concerned here is not of momentous 
interest, but the difficulties under which we labour apply to all statistical investi- 
gations of a “General Hospital Population.’ Since every youth destined to earn 
his bread as a doctor acquires his first and therefore his most vivid impressions 
of the phenomena of disease among a “General Hospital Population,” it is of 
importance to all his future patients that these impressions should be controlled, 
checked and, if possible, corrected by any available method. One such method, 
the statistical realisation of precisely what a “General Hospital Population” is, 
and how it may be presumed to differ from the nation at large, is at present 
impossible owing, in great measure, to obstacles which can be removed. This 
is now the opinion of a small minority; if it is ever adopted by a majority, those 
who, having the means of reform at their command, neglect to adopt them will 
find some difficulty in explaining their position. 


TABLE IL. 


Constants calculated from New Data (Select Cases). Healthy Organs. 
Males (25—55). 


Variable | Mean Standard Deviation | Seen 
Heart ... ke eee 11°96+ ‘14 ozs. 1°896 15°86 | 
Body-weight ... ee 129°82 + 1°80 lbs. 23 °605 18°18 
Kidneys ee Lee 10°67+ °16 ozs. 2-049 19°21 
Liver ... ae nae 57°39+ ‘86 ,, 11°263 19°63 
Age .., wat as 4127+ ‘66 yrs. 8632 | 20°92 
Brain ... ane wee) | 49°31+ °36 ozs. 4°558 9°24 
[“‘ Body-length”] ... (6899+ °20 ins. ] [2°548] | [3°69] 

| 
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We now pass to the results of our investigation and begin with the specially 
selected series, the chief constants of which are given in Tables IJ and III. All 
the mean values which can be compared with those set out in Table I differ 
significantly therefrom ; the heart is heavier, the other organs lighter than in the 
first series. It is doubtful whether the changes in relative variability of the heart 
and kidneys are significant, the difference in the latter case being only 2°41 + 1:12. 


TABLE IITA. 


Total Correlations. 


Heart | Body-weight| Kidneys Liver Brain pee) Age 
Heart | 1 | 54°04 | *564°05| 524-06] -08+-08] [-424-06 04+ -08 
Body-weight... | *65+°04 1 ‘37 +°07 574°05| °25+:°07 52+ 06] | —-03+°08 
Kidneys 56 + *05 374 °07 1 43+°06} ‘08+ 08 [-28+°07] | —-05+-08 
Liver 52+ °06 “OT + 05 43 + °06 1 13+ 08 [414-06] | —-16+4°-07 
Brain vee 308s 082 | S25 -ES OM "08 + ‘08 13+°08 1 [274-07] | —-08+-08 
[“Body-length”] [-42+:06]  [:52+-06]) [-28+-07] [-41£-06] [-27+-07] [1] [--20+-07] 
Age .. | 04+°08 | —'03+°08 705+ °08 | -"16+°07 |— 08+ °08 | [- -20+°07] 1 
TABLE IIIB. 
Partial Correlations. 
Variables “Constants” Correlation 

Heart and Body-weight Kidneys and Liver "49 + 06 

Heart and Kidneys Body-weight and Liver "41+ °06 

Heart and Liver Body-weight and Kidneys 13+ -08 

Body-weight and Kidneys Heart and Liver — 07 + 08 

Body-weight and Liver Heart and Kidneys 36 + ‘07 

Kidneys and Liver Heart and Body-weight 2107 


The liver, on the other hand, seems to be definitely more variable in the present 
series. Our finding, so far as the heart is concerned, confirms the remark of 
Pearson already cited. With respect to the correlations, those which have 
corresponding entries in Table I are larger than these, the coefficient involving 
the liver being significantly increased. Some of the other constants, while not 
matched in Table I, may be compared with the reductions of Blakeman* and 
Pearl+, viz. brain- and body-weights. Our mean brain is heavier than that 
deduced by Blakeman from the Middlesex Hospital data and near Pearl’s figure 


* J. Blakeman, ‘‘A Study of the Biometric Constants of English Brain Weights,” Biometrika, 
Vol. tv. p. 124, 1905. 


t R. Pearl, ‘‘ Variation and Correlation in Brain Weight,” ibid. p. 13. 
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for Bohemian cases. The mean body-weight is greater than Pearl’s figure and the 
coefficient of variation between that of Pearl and Pearson*. 


Since Pearl and Blakeman did not restrict their inquiry to persons between 25 
and 55 years of age, any comparison of our results with theirs must be of doubtful 
validity. Similarly we cannot control the constants involving age by comparisons 
with the work of other biometricians. In the first study, the correlation between 
age in years and “healthy” heart weight for males was given as ‘136 + 025 and 
from the same data we find in the case of females r=°155 + 029, in good agree- 
ment with the former value. Our present results cannot be said to differ signifi- 
cantly from the old values and we may quote a passage from the other memoir: 
“When we remember that the healthy heart is on the average much smaller than 
the heart in disease, and that sickness on the average increases continuously with 
age, we shall probably lay less stress on the general a priort idea that the weight 
of the adult heart increases very sensibly with age alone+.” We see no reason to 
modify this view ; further the recorded ages are likely to be subject to a con- 
siderable observational error in this class of material. 


The general conclusion therefore both of the present and the first study is that 
age alone is not proved to be sensibly correlated with the weights of the viscera in 
an adult healthy “General Hospital Population.” 


The correlations involving body-weight are the most interesting in our new 
series since, with the exception of Pearl’s results, they are so far as we are aware 
the first reductions of this variable in its association with the visceral weights 
which have been published. It may be freely admitted that the weight of a corpse 
must be so appreciably affected by the circumstances immediately preceding 
death that it cannot correspond at all closely to the weight during life. The force 
of this objection is diminished by the large percentage of violent deaths to be 
found in our series, since these cases would not be affected by the rapid wasting so 
often associated with a fatal illness. 


Chiefly for this reason, we are not without hope that the present calculations 
throw some light upon the true correlation between heart weight and body- 
weight}. One inference seems clear enough, viz. that no opinion ought to be 
ventured as to whether the weight of a heart is or is not normal without taking 
into account not only that of the other organs but also the body-weight. This 
leads to the question as to whether a formula of the nature of a regression 
equation might appropriately be employed to connect heart weight with other 
variables. In selecting variables upon which to base a prediction of the heart 
weight, the ideal is that each member of the set shall be highly correlated with 
the heart and that the correlation of the chosen variables one with another shall 
be low. For the purposes of a first trial, we decided to found the prediction 


* K. Pearson, ‘‘ The Chances of Death, etc.,” Vol. 1. pp. 298, ete. 
+ Greenwood, op. cit. p. 69. 
+ It is also to be remembered that there must be an appreciable post mortem loss of weight owing to 
evaporation in the cold chamber. 
61—2 
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upon the weight of the whole body and that of the kidneys and liver. The 
required constants having been determined* we find: 


2, = "03692, + ‘3178, +: 0192%, + 26715 <.cnseseseeee (1), 


where 2, is the weight of the heart in ounces, 2, the body-weight in pounds, « the 
weight of the kidneys in ounces, a, the weight of the liver in ounces. 


The standard deviation of the prediction is 1:235 ozs., giving a mean error of 
96 02. 


The value of such an equation depends upon the nature of the regression t 
which cannot be properly determined when the data are too few or too scattered 
to allow of grouping. A rough empirical test is to see how far the predicted value 
agrees with the known weight in a sample of the material. Blakeman applied this 
test in his series of brain weights. We took every third case in our selected series, 
determined the probable weight of the heart from the above equation and com- 
pared this with the recorded weight. The mean absolute error in the whole 26 
was ‘98 oz., which is in reasonable agreement with our expectation. We then 
asked ourselves whether the prediction could be improved. 


In the Tables II and III we have recorded, within brackets and inverted 
commas, some constants of “ Body Length.” Inquiry has elicited the fact that the 
“body length” was not true height at all but the length from the crown of the 
head to the end of the toes and had been recorded for the purposes of the under- 
taker. Since this measurement will be greatly affected by the degree to which 
the toes are flexed when rigor mortis sets in it does not necessarily bear a constant 
proportion to the stature, as was readily seen on examination of a short series of 
cases upon which both measurements had been made. Consequently the mean 
and standard deviation of this measurement are of no interest; but, since we found 
it to be substantially correlated with other measurements, it seemed just possible 
that its inclusion in an equation would improve the prediction. We found: 


a, = 0356a, + °3034a; + ‘01660, + 04184, 4+°2518 20... (2), 


where a, = heart weight in ounces, «= body-weight in pounds, #;= kidneys in 
ounces, x, = liver in ounces, 7; = “ body length” in inches. 


Applying this equation to the 26 test cases as before, the mean error proved to 
be ‘97 0z., 1e. there was no sensible improvement in the fit. 


Lastly we considered the effect of only taking two variables, the body-weight 
and the weight of the kidneys. The equation was: 


@, = O41 3a, + °3449a, 4+ 29187 .......ccccecccecoseewns (3), 


* In determining the various regression constants six places of decimals were retained in the 
coefficients of total correlation, i.e. we did not use simply the values tabulated in this paper. The 
work was arranged on the plan introduced by Yule, but several of the constants were checked by 
re-calculation, using the minors of the fundamental determinant in the ordinary way. 

+ An analysis of the data in Table VI of Greenwood’s “ First Study” shows that in the case of 
‘‘healthy ” organs the regression of heart weight upon kidney weight was effectively linear. For that 


case, N=413, r=:4004, »=-4174, and 


JN 
*67449 x iN pe —7r2=1°:776. 
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where a, is the weight of the heart in ounces, x, the weight of the body in pounds, 
x, the weight of the kidneys in ounces. 

(3) applied to the 26 cases gave a mean error of 1 ounce. 

We accordingly conclude that the weight of the heart may be predicted from 
that of the body and kidneys with an average error of some 8 per cent. of the 
mean weight of the heart, and that the introduction of the liver does not appreciably 
improve the accuracy of the result. This equation may be of use occasionally in 
the post mortem room; rough as it is, we do not think any better formula has been 
found by the anatomist. 

We finally come to the results yielded by the whole of the data sent us which 
fell between the required limits of age. 

The necessary constants are set out in Tables IV and V A and B and these 
should be compared with Tables II and III. It will be seen at once that, 


TABLE IY. 
Constants calculated from All new Data. Males ae 5): 


Variable Mean Standard Deviation | Covttcient of 
ariation 
Heart . re 1215+ °18 ozs. 2-700 22°22 
Body- weight . a 128°89 +1°64 lbs. 94°367 18°91 
Kidneys See acer tl 10°77+ ‘15 ozs. 2°267 21°05 
Liver ... ae we By/Pilvae Mell 11°890 20°82 
Age... ie ae 42°60+ ‘60 yrs. 8°844 20°76 
Brain ... ts ba? 49'10+ °30 ozs. 4°464 9:09 
TABLE VA. 
Total Correlations. All new Data. Males (25—55). 
] 
| Heart Body-weight i Kidney Liver Brain | Age | 
| 
i} 
| —— = — | — i| a 
| Heart wee 1 D6 + °05 | D3 4°05 464°05, ‘O9+°07) ‘06+4°07 
| Body-weight... | *56+°05 “40 + ‘06 ‘iB7 +:°05 244-07 |- 044-07 
Kidney a 563+ °05 ike +06 1 32+ °06 10+°07 | -04+°07 
Liver ate "46+ °05 ‘57405 *32+4°06 1 14+°:07 |-°11+:°07 
| Brain ies ‘09 + ‘07 24+ °07 *10+°‘07 14+°07 1 — ‘08+ :07 | 
| Age -- | O64°07 | —°044°07 | 044°07 |-°114 07 |— 08+ °07 1 
ip Ae. 7 | | 
TABLE VB. 
Partial Correlations. 
Variables Constants Correlation 
| 
— == | — | 
Heart and Body-weight Kidneys 454°05 | 
Heart and Kidneys Body-weight “40 + -06 
Kidneys and Body-weight Heart | 115+ °07 
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excepting the variability of the heart, no appreciable alteration can be discerned. 
Such changes as there are, e.g. the increased variability of the heart and its 
lessened correlation with the other organs, point to the first selection as being 
somewhat more healthy than the whole series, but the differences are evanescent. 
The regression equation of heart upon body and kidney weight as determined from 
the whole series is: 


a, = "04620, + °42600%.+ 16143 .......ceeeeesencecoeees (4), 


where a, = heart weight in ounces, #, = body-weight in pounds, x, = kidney weight 
in ounces. 


The mean error for the 26 test cases is 1°] ounces. 


We think the inference to be drawn from this comparison is that, in order to 
arrive at the values of the true post mortem constants descriptive of a healthy 
“General Hospital Population,” the process of selection need not be very stringent, 
so far as the diagnosed cause of death is concerned. 


We also think the whole tendency of the present investigation is to suggest 
that the constants here published are, within the limits indicated by the errors of 
sampling, representative of a healthy “General Hospital Population.” None of 
our results are in flagrant opposition either to the findings of observers equipped 
with more accurate data or to those previously obtained from hospital series. We 
hope, therefore, to have added another item of use to the student of a “General 
Hospital Population.” Of course, being restricted to a single type of measurement, 
this study is of but limited interest. Shrubsall has published* the results of an 
inquiry into certain anthropological characters of a “General Hospital Population,” 
while Greenwood and Candy + have shown cause for believing that the fatality of 
certain common diseases, such as lobar pneumonia, may be quite different in this 
class from that obtaining in the community at large. These are other items in 
a catalogue which has yet to be completed. It may well be possible for a future 
synthesist to paint a fairly accurate composite picture of a section of our country- 
men which is of immense importance not only in itself but also as the substratum 
of medical instruction. At present, however, we must be contented with the task 
of collecting material, in particular statistical records. 


The general conclusions of this inquiry may be summarised as follows : 


(1) The mean weight of the heart in healthy adult males of the class from 
which patients in a London General Hospital are drawn is not far short of 
12 ounces, 


(2) The correlation between the weight of the heart and that of the whole 
body is not much less than ‘5. 


* F.C. Shrubsall, Brit. Med. Journ. Vol. 1. p. 1689, 1904; St Barts. Hosp. Reports, Vol. xxx1x. 
p. 64. 
+ Journ. Roy. Statist. Soc. Vol. uxxtv. p. 365, 1911. 
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(3) The weight of the heart can be deduced from the weight of the body and 
that of the kidneys by means of a linear equation with an average error of about 
eight per cent. 

(4) The weight of the body and that of the kidneys ought always to be con- 
sidered before deciding whether a given heart is abnormally large or small. 

(5) It is doubtful whether the weight of the heart is sensibly correlated with 
age in an adult population (of ages 25—55). 

(6) The weight of the brain is less closely associated with the body weight and 
that of other viscera than is the heart weight. 


APPENDIX OF MEASUREMENTS. 


Data for the Study of Visceral and Organic Correlations in Healthy Male 
Adults (25—55). 


Body- | [‘‘ Body- | Heart- | Brain- 
econ F Age weight Gane weight | weight Kidneys ayer Cause of death 
Se eee (Ibs.) (ins.)] | (0zs.) | (ozs.) (025) (z5-) 
19). || wali 1) 1120) 69 Ialesy 4) fails} De25 57°5 Fract. Skull 
68, 712 | 45 | 155 67 | 14°75 | 525 | 14:5 | 70 - a 
80, 712 | 38 138 69 13°5 | 52 9 42°5 | Abdom. Injury 
114, 712 4] WAG 67 10°5 49 9°5 50°5 Fract. Pelvis 
yesh, Ale) 48 132 67 14°75 | 50 12°5 575 Tetanus 
190, 712 | 50 136 68 13°56 | 46 Wigs) 64 Ruptured Intestine 
958, 712 | 52 126 68°5 | 10°5 | 52°5 10°75 | 47 Fract. Skull | 
20,711 | 43 112 68 95 | 52°5 11:25 | 68°5 | Bronch. Pneumonia 
23 Alero 157 71 115 | 49°5 9°5 74°5 | Appendicectomy [mothorax | 
Seale eA 128 67 12 48 NES 74 Traum. Rup. of Oesoph. Pneu- 
61, 711 25 95 67 10°5 49°5 10 | 48 Periton. Perf. Duoden. Ulcer 
133, 711 | 41 133 69 11°75 | 47°75 12°75 | 65-5 | Strang. Hernia Perit. 
144, 11 | 34 106 66 10 -— 95 | 57 Perf. Gastric Ulcer 
169, 711 | 54 138 70 14°5 = 13°5 90 Fract. Spine 
174, 711 | 53 145 67 12 — 9 | 48 Appendicectomy 
PASI LT 30 160 tii 11 545 9 52°5 | Accident 
Doom Blse|) 116 7 14 = 145 | 71:5 | Glioma of Brain 
374, 711 38 148 72 15 395 165 | 68 Anaesthetic Death 
554, 11 | 54 | 136 67°5 | 12:25 | 47 9°75 | 64 | Accident 
629, 711 35 119 70°5 | 11 50 8 | 63 Tetanus 
913, 711 46 128 72 12 54 9 | 59 Fract. Skull 
1197, 711 o4 100 66 9°75 | 43 | 85 | 34 Fract. Pelvis 
1934, 11 | 50 | 115 69 | 11°75 | 39°75 | 9:5 =| 49-5 . . 
1236, 711 | 49 121 68°5 | 13 50°25 «11 56°5 | Injury to Head 
56, 710 | 52 149 72 14:25 | 52 | 10°25 70 Fract. Base 
96, 710 34 168 72 13°25 | 47 9 755 Fract. Ribs 
99, 710 | 26 95 66 9 43 7 50 Corrosive Poison 
221,710 | 49 126 67°5 | 12°75 | 46°5 11 66°5 Fract. Base 
251, 710 | 47 108 69 95 | 51°5 9 55°5 | Fract. Ribs 
284,710 +36 188 71 13°56 | 48°5 115) | 79°5 Fract. Base 
308, 710 | 40 98 68 10 43 10° | 48°5 Tetanus 
403, 710 35 123 69 13 53 12°5 62 Fract. Skull 
501, 710 46 102 64 12 46°5 14:25 | 34 Fract. Pelvis 
522, °10 | 46 121 69 JO=De R53 85 | 46 Epilepsy | 
608, 710 33 170 70°53 | 135 | 45 12 60 Fract. Skull | 
672, °10 52 129 71 11°5 53°5 11 | 54°5 | Fract. Base | 
| 
\ 
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Data for the Study of Visceral Correlations—(continued). 
Case nee | raat ieee eee soy Kidneys | Liver Cause of Death 

Number (years) (Ibs.) (ins.)] (ozs.) | (ozs.) (ozs.) (ozs.) 

1047, °10 40 114 69 9°5 48 11°75 70°25 | Fract. Skull 

1048, 710 46 132 67°5 | 13 42°25 11°75 =| 57°5 Fract. Base 

1054, ’10 51 156 67 15°5 49°5 12°5 70 Fract. Ribs 

1159, 710 46 184 73 12°5 59 12°5 60 Shot 

1160, 710 32 217 76 19°5 59 13°25 74°5 3 

1172, 710 50 Wat 70 11°5 48 12°75 56°5 Fract. Ribs 

1190, 710 49 112 66 9 44 9°5 46°5 Accident 

226, ’09 33 128 — 11°5 44°5 10°75 | 41 Fract. Base 

294, ‘09 52 112 ql 9:25 | 53 11 45 Fall 

297, ’09 45 114 69 9°75 | 48°5 95 | 42 Accident 

298, 09 | 50 | 133 67 9°5 | 49 7 47 - 

299, ’09 50 127 71 14°25 | 47 13 66 * 

329, 09 | 52 105 70 11°75 | 53 9 40 Oxalic Acid Poison 

336, 09 | 29 | 121 69 | 10°75 | 545 | 12 55°5 | Epileptic Fit 

356, 709 | 48 108 69 11°5 49 9°5 50°5 Burns 

365, 09 | 30 115 70 11-5 =| 49 14 60 Tetanus 

383, 709 | 26 135 73 14°5 44 9°75 53°5 Stabbed 

449, ’09 33 136 74°5 | 14°25 | 47°5 11 59 Tetanus 

460, 709 | 43 122 72 11°5 | 50 11°25 | 64 Tetanus and Bronch. Pneumonia 
865, 09 | 49 105 69 10°25 | 43 8 36 Cut Throat 
1079, 709 | 48 127 66 13°5 | 56 11°5 53°5 Run Over 

1261, ’09 40 149 69 13 48°5 11 62°5 Accident 

109, 08 26 105 65 10°5 46 10°5 62°5 Epileptic Fit [Peritonitis 
145, 08 | 37 145 70 115 | 47 12 78 Anaesthetic Death, Appendic. 
151, 08 | 48 96 65 95 | 47 8 43 Epileptic Fits 

169, 08 | 35 105 64 12 44 8 50 Fract. Skull 

172, 08 | 41 | 126 70 | 14:5 | 48 11:5 | 60 apts 

186, ’08 25 130 70 11°5 52°5 12 60 5 . 

202, 08 34 108 66 9 48 6°5 42 Scalds 

207, 08 | 41 107 68 11 62°5 10 52 Fract. Skull 

218, 08 | 44 147 68 10 515 9°75 | 63 Periton. Rup. Duod. Ulcer 
221, 08 | 49 161 68°5 | 14°75 | 45°5 13°75 | 60 Fract. Base 

384, 708 45 144 66 13°5 46 11 69 Burns 

404, 08 | 32 136 65 12 52 13 — Caught in Machinery 

472, 08 | 26 148 73 11 55°5 10 57°5 | Accident 

505, 708 | 35 146 70 13 58°5 10 60 Opium Poisoning 

542, ’08 26 107 67 10°5 45 8°5 44°5 Fract. Skull 

573, 708 | 35 169 71 14 — 14 73 Pulm. Emb. after Operation 
680, 708 | 46 118 67 11 43 85 44 Fract. Spine 

709, ’08 26 122 67 10°5 55 12 49°53 Perf. Duod. Ulcer 

751, 708 50 110 65 12 53°5 10 53 Tetanus 
1014, 708 50 109 70°5 | 11°25 | 46 9 51°5 Fract. Spine 

1104, ’08 32 116 70 11 55 12°5 58°5 Stovain Poison 


* Fract. Skull 19. Other Fractures 11. 


Classification of Causes of Death in above Data. 


Accidents, Suicide and Homicide 


Intestinal Injuries and Post- Operative Effects 


Tetanus 
Poisoning 
Epilepsy 


Glioma of Brain ... 
Bronch. Pneumonia 


“ Accidents” 11. 
Burns and Scalds 3 


49* 
13 


Se yw wT 


79 


Homicide 4, Injury to Head 1. 
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Other cases within the same limits of Age but excluded from the first Analysis. 


Body- 


Number |(years) pay 


97,’09 | 50 | 166 
1244, 09 | 52 | 123 


* Shght Hypertrophy. 
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|| ? Hyperpiesis. 


[‘‘ Body- Brain- 

length” weight | Cause of Death 

(ins.)] (ozs.) | 

74 48 4:25 | Ruptured Aneurism 

71°5 55 1 | Ruptd. Gast. Art., Chr. Gast. Ulcers 
71 48 9 | Ruptd. Cong. Aneur. of Cereb. Artery 
74 — 9 Anaemia and Scars of Gast. Ulcers 
65 48 13 Fract. Base (? Hyperpiesis) 

69 48 15°5 | Operation. Cirrhosis of Liver 

72 51 11°75 Operation. Enterectomy. Peritonitis 
65 45 8 Fits. Anaemia 

66 — 8°5 Carcinoma of Oescphagus 

71°5 — 19°5 Heart Failure ; Hyperpiesis 

71 54°5 | 10 Rupt. of Cereb, Aneurism 

69 — 10°25 B. Pna. Cerebral Tumour 

68 39°5 7 Rupt. of Aneurism 

69 47 11°5 Accident 

70 — 9 Carcinoma of Stomach 

72 ~ 8°75 Chron. Nephritis 

70°5 48 14°5 Diss. Tuberculosis 

69 45°5 | 11 | Cerebral Haemorrhage 

72 = 12 | Rupt. of Aortic Aneurism 

69°5 49 8°75 ? Hydrocyanic Poison ; ? Epilepsy 
69°5 48 9°25 | Fract. Base 

+ Brown Atrophy. { Fatty Degeneration. § Hyperpiesis. 


S| Hyperpiesis. 
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THE MEASUREMENTS OF THE PELVIS, WITH SPECIAL 
REFERENCE TO OBSTETRIC PREDICTION. 


By D. H. DE SOUZA, Assistant Physician to the Westminster Hospital. 
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(1) Introductory. 


The publication by Emmons* of a number of measurements of the female pelvis, 
carefully taken by himself, has afforded us an opportunity of investigating, by 
modern statistical methods, the relationship of the diameters to one another and 
to the bone measurements. As far as we can ascertain this has not previously 
been attempted. The subject has interested us not only from the statistical and 
anthropological sides but especially on account of its importance in practical 
obstetrics. Some of the diameters of the female pelvis can easily be measured 
on the living subject, others not at all, or only with great inconvenience to the 
patient. Unfortunately the two very important diameters at the pelvic brim, 
the obstetric conjugate and the transverse diameter, fall within the latter category. 
Empirical rules have been given, for example, to determine the obstetric conjugate 
from the diagonal conjugate, a diameter comparatively easily measured, but these 
lack both mathematical treatment and adequate statistical support. We have, 
in the following paper, considered methods of calculating these two important 
diameters from other pelvic measurements more readily obtainable. After making 
due allowance for moisture and soft parts we have left our results in a form which 
can be applied to the measurements as taken from the living subject. 


* Biometrika, Vol. 1x. p. 34, 1913. 
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Material. Emmons measured 217 pelves of the American Indian squaw in 
different museums in the United States of America. Our calculations have been 
made from these measurements; 216 pelves were considered in most cases, No. 115 
in Emmons’s Table being omitted as the diagonal conjugate was not stated. In 
calculations involving the pubic height only 212 were available, as this measure- 
ment was not given in Nos. 13, 72, 79, and 132 in his Table. 


For the definitions of the different diameters and the way in which each was 
measured reference must be made to Emmons’s paper. The names used by him 
have been retained with one exception. The term oblique diameter, as it has more 
than one significance, has been replaced by diagonal conjugate. The latter name 
is that usually given in this country to the diameter in question (sacro-subpubic) 
and is less likely to cause confusion. 


Two modern English pelves, one male and one female, and two ancient 
Egyptian female pelves, kindly lent by Prof. Thane and Dr Derry respectively, 
were measured in the dry and moist state to ascertain differences in the diameters 
under these conditions. The measurements of these pelves and of an ancient 
Egyptian male pelvis, the property of Prof. Pearson, were employed to test the 
formulae calculated from the American Indian pelves. 


For the numbers showing the thickness of the soft parts over the hip bone we 
are indebted to Dr Derry. They were obtained from measurements taken on 
20 post mortem subjects. The maximum inter-cristal diameter was measured 
with callipers. As will appear later, a knowledge of the height of the hip bone 
may be of service in calculating diameters. This height, however, would vary in 
the living subject according to the position of the pelvis. If a woman be seated 
on a table it is easy to measure the maximum height of the iliac crest above the 
table. This could readily be done by the obstetrician if the measurement proved 
useful to him. It was therefore decided to consider this a measure of the height 
of the hip bone in the moist condition and covered with tissues. With the thigh 
of the corpse flexed to the sitting posture and the callipers pressed from above on 
to the iliac crest and from below on to the ischial tuberosity, this height is obtained. 
Dr Derry soon found, however, that he got the same result if he took the measure- 
ment with the subject supine, one limb of the callipers being pressed down from 
above on to the iliac crest as before, the instrument crossing in front of the groin, 
and the other limb passing between the thighs to reach the ischial tuberosity from 
below. Dr Braxton Hicks has confirmed this independently on three subjects. 
Accordingly most of the measurements have been taken in this manner. 


The thickness of the soft parts was measured by means of an instrument with 

a rounded blade about as thick as a coarse darning needle but with not so fine a 

point. This was fitted into a hollow handle into: which it could be pushed against 

a spring. A graduated scale showed the length of needle protruding from the 

handle. When in use the blade was passed through the skin and soft parts till 

its point was in contact with the bone. The handle was then pushed firmly 
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against the skin so that the only part of the blade protruding from it was that 
extending from the skin to the bone. The length of this was read off and gave 
the thickness of the soft parts covering the bone. This was measured over the 
ischial tuberosity, and over the iliac crest in vertical and horizontal directions. 
The first two measurements, subtracted from the measurement already obtained 
for the height of the hip bone covered with tissues, will give the height of that 
bone in the moist state. By subtracting the thickness of tissues over the ilium 
in a horizontal direction from the maximum inter-cristal diameter previously 
measured, the length of that diameter in the moist pelvis free from tissues can 
be ascertained. 


The instrument was designed and made for us by Mr H. E. Soper of the 
Biometric Laboratory. 


(2) The relationship of the diameters and bone measurements to one another. 


The relationship of the diameters to one another was studied by means of the 
coefficient of correlation. As a preliminary, the average length and variability, 
us measured by the standard deviation, were found for each one. Emmons has 
already given the averages. Our results differ from his in some cases, probably 
owing to the fact that we divided the data into a larger number of groups. The 
diameters and bone measurements considered, together with the average and 
standard deviation of each, are shown in the following table. For the purpose 
of comparing the variabilities, the coefficients of variation are also given. All 
measurements in this and subsequent tables are expressed in centimetres. 


TABLE I. 
Coefficient 
avenge | landed | St 

Variation 
Inter-cristal ... =P en 25°77 4°07 1°44+ 05 5°57+°18 
Inter-spinous nee oe 22°66 + :08 1°65 + ‘05 728+ 24 
Transverse (brim) ... aes 12°97 + :08 "72+ 02 5°56 +°18 
Diagonal conjugate eae 12°14+ °05 1:02 + :03 8°44+ °28 
Obstetric conjugate ee 10°32 + °05 1°10 + °04 10°62 + °35 
Antero-posterior (outlet) ... 11°61 + °04 90 + 03 7°76 4°25 
Inter-tuberal eae ae 9°78 +'04 -90 + 03 9:22 + °30 
Posterior sagittal ... Au 7°56 + 04 "82+ 03 10°89 + °36 
Pubic height isle ae 2°81 + :02 52+ 02 18°30 + *62 
Right hip—height ... ..., | -19°264-05 | 1:02+-03 BQ7+°17 
» yy breadth . | 14°524-04 87 +03 597 +19 
Left hip—height ... me 19°32 + °05 1:03 + ‘03 5:32+°17 
aatss breadth ... baa 14°49 + 04 *83 + °03 5°72 4°19 
Sacrum—height ... ee 10°00 + ‘05 115+ :04 11°52 + °38 
" breadth ... ABE 11°47 + °03 66 + 02 5714-19 


The pubic height is the most variable of all the measurements. Of the 
diameters the transverse and inter-cristal show the least variability. The height 
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and length of the hip bone and the breadth of the sacrum have coefficients not 
very different from those of the transverse and inter-cristal diameters, and the 
height of the sacrum has a variability just about double that of any other bone 
measurement. The results are not surprising. Apart from the variation in a 
measurement due to individual differences, there are other causes for variation 
in the case of these measurements taken on old pelves. It is not always easy to 
obtain accurate measurements, especially of the diameters, and after death parts 
of the bone get worn away. The wearing away of the pubic crest, for example, 
and the difficulty which may be experienced in determining the lowest point of 
the symphysis, must be accountable to a great extent for the large variability 
of the pubic height. The height of the sacrum, being dependent on the number 
of vertebrae united to form that bone and on the integrity of the tip, which is apt 
to get worn, is more variable than the breadth of that bone and than the height 
and breadth of the hip bone. The extremities of these last are rounded and less 
likely to be worn. The breadth is a little more variable than the height, and 
experience proves that, of the two, it is the more difficult to measure accurately. 

The diameters must depend on the same factors as the bones, the individual 
differences, wear and tear, and difficulty of getting accurate measurements. The 
last especially must be of importance here. The obstetric conjugate, for example, 
depends upon the position of the sacral promontory, and therefore on the number 
of vertebrae forming the sacrum and on the proper fitting together of the bones, 
on the integrity of the upper and posterior part of the pubic bone, and on the 
ease or difficulty with which its anterior extremity can be determined. How 
much of the variation is due to each of these factors it is impossible to say. In 
such a diameter as the transverse there is little likelihood that the parts of the 
bone forming its extremities will be much worn away, but it would be unwise to 
go beyond a general statement of this kind. 

The pubic height and the height of the sacrum vary more than any other 
measurement of a single bone with which we are acquainted. The measurements 
of the mandible come next in order. Their coefficients of variation, as given 
by C. D. Fawcett*, are, for the greatest height 9°93, for the greatest width at 
the condyles 7°46, and for the greatest width at the angles 7°62. The variations 
in the lengths of the long bones in different races (found for the French and 
Aino by Lee and Pearson+, for the Naqada by Warrent, and for other races by 
Pearson§) have been compared with those of our measurements. The coefficients 
have as their limits 417 and 7:00, most of them being less than 5°5; hence the 
measurements of the pelvic bones, other than the pubic height and the height 
of the sacrum already considered, although in some few cases varying to about 
the same extent as the lengths of the long bones, have, on the whole, a tendency 
to be more variable. 

* Biometrika, Vol. 1. p. 408, 1902. 
+ Proc. Roy. Soc. uxt. p. 343, 1897. 


t Phil. Trans. Roy. Soc. Series B, cLxxxtx. p. 135, 1897. 
§ The Chances of Death, Vol. 1. p. 256. London, 1897. 
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The coefficients of variation of the diameters were compared with those of the 
other recorded measurements which include more than one bone. The coefficients 
of the greater number of the skull measurements* lie between 3 and 5. There 
are a few which are more nearly equal to those in Table I. Some of the measure- 
ments of the face, for example, have coefficients lying between 6 and 8, and that 
of the breadth of the palate is 9°29. The diameters of the pelvis are, however, 
more variable than the bulk of the cranial measurements. 

The average bone measurements given in Table I are also of interest. The 
asymmetry of the female pelvis has been pointed out from time to time by 
anatomists and has been strongly insisted upon by Hasse+. According to him 
the height of the left side of the pelvis is greater than that of the right, while the 
breadth is less. The numbers in the table bear out this statement, but, while 
Hasse makes these differences about 1 cm. each, the table shows them to be 
‘06 and 03 cm. respectively. In this connection it may be noted that, in the 
20 subjects measured by Dr Derry, the average height of the left hip bone in 
the moist condition was "15 cm. greater than that of the right. Of the three dry 
female pelves used by us for testing purposes one followed Hasse’s rule but the 
others varied, so that the average of the three showed the height of the right hip 
bone to be ‘05 and its breadth ‘04 cm. greater than that of the left. The number 
of negative results, though worthy of record, is, however, too small to place against 
the positive findings. 

The coefficients of correlation were next found for every diameter taken with 
every other diameter in turn. The results are summarised in the table. 


TABLE II. 
Inter- | Inter- | Trans-| Diagonal | Obstetric | Antero- | Inter- | Post- | Pubic 
crests | spines | verse | conjugate | conjugate | posterior} tubers | sagittal | height 
Inter-crests ... ae 1 “719 ‘61 "24 Bilzy “30 10 ‘ll 28 
+:°02 | +°03 +°04 +04 +°04 | 4°05] 4:05 | 4:04 
Inter-spines ... re eaal| aaio 1 52 "16 13 18 07 13 "22 
+02 +03 +04 +°05 +:04 | 4°05] +:05 | +:04 
Transverse... ma ‘61 52 1 09 ‘07 35 By 17 10 
+°03 | +°03 +05 + 05 #°04 | +°04] +:°04 | +:05 
Diagonal conjugate ... | *24 16 "09 1 91 31 "20 ‘21 14 
| +04 | +°04| 4°05 OL +:°04 | +°04) 4°04 | 4°05 
Obstetric conjugate... | 17 13 | -07 “91 1 30 "19 21 05 
| +°04 |) +°05 | +-05 +01 +°04 | +04] 4°04 | 4:05 
Antero-posterior ... | °30 18 “35 “31 “30 1 32 “49 07 
| +:04) 4:04} +°04 +:°04 +:04 +:04] +°03 | 4:05 
Inter-tubers vee, | | GLO ‘07 “32 20 ‘19 32 1 36 — ‘06 
| #:05 | +°05 | +°04 +04 +:°04 +:04 +:°04 | +:°05 
Post-sagittal ... ve | cll 13 17 21 21 49 36 1 004 
+°05 | £:05 | +°04 +°04 +04 +:03 | +:04 +:°05 
Pubic height vais 28 22 10 14 05 ‘07 — 06} -004 1 
+:°04 |} 4°04} +°05 +°05 +:°05 +°05 | 4°05 | +:°05 


* Fawcett, C. D., loc. cit. 
+ Archiv fiir Anat. p. 244, 1891. Ibid. p. 1, 1910. 
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As the obstetric conjugate and the transverse diameter at the brim have to be 
obtained in practice not by direct measurement but by calculation, and as a know- 
ledge of them is important and often necessary, special attention should be paid 
to these two diameters. The highest correlation (91) exists between the obstetric 
and diagonal conjugates, and this justifies the reliance always placed by the 
obstetrician on the use of the latter to determine the former. The obstetric 
conjugate shows a moderate degree of correlation (30) with the antero-posterior 
diameter at the outlet, but the coefficient is small between it and each of the other 
diameters, being very low for the transverse diameter (07) and pubic height (-05). 
The diameters with which the transverse diameter at the brim is most highly 
correlated are the inter-cristal (61) and inter-spinous (52). Next in order are 
the antero-posterior at the outlet (35) and the inter-tuberal (32). With the 
other diameters the correlation is low. 


In view of the fact that the pubic height forms one side of a triangle, of which 
the obstetric and diagonal conjugates form the other two sides, it is sometimes 
taken into consideration in calculating the obstetric conjugate from the diagonal 
conjugate. Thus, according to Waldeyer*, “in order to obtain the obstetric 
conjugate from the diagonal conjugate 1°75 to 2 cm., on an average, must be 
subtracted, and, the more acute the angle between the diagonal conjugate and 
the symphysis, and the higher the symphysis, the more must be taken away.” 
It will be seen, on referring to the table, that the pubic height is not highly 
correlated with any diameter, and that with the obstetric and diagonal conjugates 
the coefficients are ‘05 and ‘14 respectively. 


In order to arrive at formulae for finding the diameters it is necessary to use 
the measurements with which they are most closely correlated. The coefficients 
of correlation of the obstetric conjugate and the transverse diameter at the brim 
with the bone measurements were therefore found, to see whether any of them 
would be high enough to be of service. These are now given : 


Obstetric conjugate Transverse 
Right hip—height ne 55 +03 *48 +04 
r 5, breadth ue "48 + 04 45 + 04 
Left hip—height ae 56 +03 “45 + 04 
» yy breadth - ‘51 +-03 48 +04 
Sacrum—-height nee 314 °04 21+ °04 
% breadth one 387 4:°04 56 +03 


All these measurements are more highly correlated with the obstetric conjugate 
than is any diameter except the diagonal conjugate (91). With the transverse 
diameter the measurements of the hip bone show higher correlation than do any 
diameters except the inter-cristal (61) and inter-spinous (°52), and the coefficient 
of correlation of the breadth of the sacrum with this diameter (56) is exceeded 
only by that of the inter-cristal with it. Of these measurements, however, the 
only one which can easily be made with sufficient accuracy on the living subject 


* Das Becken, p. 49. Bonn, 1899. 
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is the height of the hip bone. This can be taken in the sitting posture as already 
indicated. Since it is one of the measurements most highly correlated with both 
the diameters under consideration, it will have to be taken into account in working 
out the formulae from which these diameters are to be found. Its coefficients of 
correlation with the diameters most closely correlated with the obstetric conjugate 
(diagonal conjugate) and the transverse diameter (inter-cristal and inter-spinous) 
will therefore be required. These have been calculated for both hip bones. 


Right hip— Left hip— 
height height 
Diagonal conjugate ... eo 63 + °03 64+ 03 
Inter-crests ... i ss 56 +08 57 +°03 
Inter-spines ... Be ie 444+ °04 46 +04 


Apart from the closeness of the figures on the two sides these results call for 
no particular notice. 


As a matter of anthropological interest the relationship of the bone measure- 
ments to one another was studied, and a table of the coefficients of correlation 
drawn up on a plan similar to that employed for the diameters. The correlation 
is greater between corresponding measurements of the hip bones on the two sides 
than between the height and length of the same bone. Thus the heights of the 
two hip bones bave a coefficient ‘98 and the breadths a coefficient ‘94, while that 
for the height and breadth of the right hip bone is ‘78 and for the same measure- 
ments on the left side ‘81. This closer relationship between corresponding 
measurements on the two sides than between measurements on the same side 
has been noted by previous observers for other bones*. The crossed correlation 
between the height of one bone and the breadth of the other is not very different 
from the direct correlation between the height and breadth of the same bone, 
eg. height of right hip bone with breadth of left bone gives a coefficient ‘79, 
height of right bone with breadth of right bone a coefficient ‘78, height of left 


TABLE III. 
Right hip— | Right hip— | Left hip— Left hip— } Sacrum— | Sacrum— 
| height breadth height breadth height breadth 
i _ 
Right hip—height ... | 1 ‘78 98 “79 "34 “59 
+:°02 | +:°002 + 02 +04 +°03 
Right hip—breadth ... | ‘78 1 eeu 94 31 “BD 
+ 02 +°02 + 005 +04 + °03 
Left hip—height ... | 98 77 1 81 37 61 
; 2#:002 | 302i] +:°02 +04 +03 
Left hip—breadth ... “(h) 94 “81 1 | 30 58 
+°02 +005 +°02 +04 +:°03 
Sacrum—height aes 34 | 31 37 30 1 32 
+:°04 | +#°04 | +:°04 +°04 | +:°04 
Sacrum—breadth ... “59 55 61 58 | 32 | 1 
+:°08 +°03 +03 +:03 | +:04 


* Lewenz and Whiteley, Biometrika, Vol. 1. p. 345, 1902. 
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bone with breadth of right bone a coefficient ‘77, and height of left bone with 
breadth of left bone a coefficient ‘81. Note also that there is no great difference 
between the coefficients of the corresponding measurements of the right and left 
hip bones with the sacrum (see first four numbers in the last row or column in the 


table). 


Of only one pelvic bone, the sacrum, have the coefficients of correlation of the 
measurements been recorded previously. Warren*, studying the bones of the 
Nagqada race, found that the coefficient of correlation for the height and breadth 
of this bone was ‘46 for 32 males examined and ‘31 for 45 females. The latter 
number is practically identical with our finding (‘32). The coefficients given by 
him to show the correlation of the measurements of the long bones with one 
another lie hetween ‘82 and ‘98 for the male, and ‘70 and ‘96 for the female, with 
the exception of that for the maximum length of the clavicle with the oblique 
length of the humerus, which is ‘68 for the male and ‘53 for the female. Lewenz 
and Whiteleyt, for the bones of the hand, found coefficients with a range “43 to 
‘90 for measurements of the same finger, and ‘59 to ‘95 for other measurements on 
the same hand. If we omit coefficients between corresponding measurements 
on the two sides our coefficients have a range ‘30 to ‘79, and, on the whole, are 
lower than those between the measurements of the long bones toyether and of 
the bones of the hand together, being nearer to the latter than to the former. 
They are, however, greater than those between the diameters of the skull which 
C. D. Fawcett} found to lie between -27 and ‘49 for the male and ‘12 and ‘28 
for the female. 

It is interesting to note here that the coefficient between the length and 
breadth of the skull is ‘34 in the male and ‘14 in the female, while that between 
the length and breadth of the pelvis at the brim, Le. between the transverse 
diameter and the obstetric conjugate, is ‘07, and at the outlet, i.e. between the 
antero-posterior and inter-tuberal diameters, °32. 

(3) The finding of the obstetric conjugate and the transverse diameter for the 
dry pelvis. 

The method adopted to determine the non-measurable from the measurable 
diameters was that employed by Prof. Pearson§ for obtaining the stature of an 
individual from the lengths of the long bones. The equation of the regression 
line was used for this purpose. In the equations which follow the lengths of the 
diameters are represented by letters as shown below: 


Obstetric conjugate C Inter-tubers J 
Transverse T Inter-crests K 
Diagonal conjugate D Inter-spines  S 
Antero-posterior <A Pubic height P 
Height of right hip R 
gr deft, 2 
* loc. cit. + loc. cit. +t loc. cit. 


§ Phil. Trans. Roy. Soc. Series A. Vol. cxcir. p, 169, 1898, 
Biometrika 1x 63 
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(A) Obstetric conjugate. Equations were obtained for deriving this from each 
of the diameters with which it is most closely correlated, viz. the diagonal con- 
jugate (-91), and the antero-posterior (30). These equations are 


CO =-978.D =1-553 #299. 1... ee (a), 
C=-365 A 46-0814 705 2.0.0.) ee (b), 


the first of which has a much smaller probable error (‘299) than the second (705). 
A single equation was also obtained, using both diameters 


C= 972.D +0238.A — 1748 +°298 «0.2. a eens nomena (c) 


with an extremely slight improvement of the probable error as compared with (a). 


The diameters next in order of correlation with the obstetric conjugate are the 
posterior sagittal (21) and the inter-tuberal (19). The posterior sagittal is not 
measurable on the living subject. The inter-tuberal was measured in a special 
way on the dry pelvis by Emmons and is not necessarily the same as the inter- 
tuberal diameter as understood by the obstetrician. The method is not applicable 
to the living subject, but this diameter was employed here to test whether it was 
of sufficient importance for its measurement to be followed up further. An 
equation was formed by taking it with the diagonal conjugate and the antero- 
posterior diameter 


C=972D + 023A + OOLL — 1-745 £:299 veeceeeeeeees (d). 


On comparing (c) with (d) it is apparent that the addition of the inter-tuberal 
diameter makes little difference, and that little to the bad, as is shown by the 
probable errors. This diameter will therefore not be considered further. 


It has been pointed out previously that the pubic height is sometimes used 
in calculating the obstetric conjugate but that its correlation with this diameter 
is low (05). The equation connecting the obstetric conjugate with the pubic 
height is 

C= 110P' + 10-011. 4-738) 225. eee (e), 


having the largest probable error (‘738) of the equations yet obtained. The result 
of combining the pubic height with the diagonal conjugate is, however, good : 


C= 989 D —"165P —1-229 4-203 eee (f) 


with a probable error lower than that of (c). This is due to the fact that, although 
the obstetric conjugate is more highly correlated with the antero-posterior diameter 
(30) than with the pubic height (05), the former of these two is more highly 
correlated with the diagonal conjugate (31) than is the latter (14), and it has 
been found that when the crossed correlation between the variables is high the 
resulting equation is not so good as when the crossed correlation is low. The 
pubic height, whether measured externally or internally, involves exposure of or 
inconvenience to the patient, so that it is desirable to avoid its use if just as good 
or nearly as good results can be obtained by other means. 
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Of the measurements which can be taken on the living subject, next to the 
diagonal conjugate the height of the hip bone shows the closest correlation with 
the obstetric conjugate (right °55, left °56). Equations were formed using the 
height of each hip bone 

C= 590 fs Oat OND) caveatiess sta reeten: (9), 


C6021 Or Ol Oe rcemnossiecanccsayse. e (h). 
The equations as tested by probable errors are better than ()) and (e) but not 
as good as (a), (c), (d), and (/), but if each of the measurements be taken with the 
diagonal conjugate the resulting equations are exceedingly good 
C= O08 D 049 HO fick 298 near cody sie ssele seas (2), 
C= 1000) 012 Li VOTE 298 sosces coacantamenee (7): 


The probable errors to four places would be ‘2976 and ‘2978 respectively while 
that of (c) is 2984, so that the probable errors are less than that of any equation 
except (f). These equations have a practical advantage over (/), for the height 
of the hip bone can be easily measured on the living subject whilst, as already 
mentioned, there are disadvantages attached to the measurement of the pubic 
height. Moreover, the measurement of the height of the hip bone is less 
objectionable to the patient than that of the antero-posterior diameter, so that 
equations (7) and (7) possess a practical advantage over (c). 


To facilitate reference all the equations for finding the obstetric conjugate are 
now collected together: 


(a) C= 978D—1:553 + -299. 

(b) C= 365A +6081 +705. 

(c) C= 972D +-023.A —1°743 + 298. 

(d) C= 972D+-0234 +0017 —1-745 + 299. 
(ec) C= 110P+10011 +:738. 

(f) C= 989D—-165P — 1-229 + 293. 

(g) C= 590R—1-087 +619. 

(hk) C= 602L—1:310 +°610. 


(i) C=1008D—-049R —-977 +298. 
(j) C=1:005 D—-042L—1-:075 + 298. 


Equations (/), (2) and (j) appear to indicate that, for a constant diagonal 
conjugate, the obstetric conjugate is smaller the higher the pelvic wall. 


These formulae have been tested for the original series and for five other 
pelves. The cards of 25 of the 216 pelves were picked out at random and the 
obstetric conjugate was calculated from the recorded measurements by means 
of the equations, with satisfactory results. The square root of the mean square 
deviation of the calculated from the recorded values of the obstetric conjugate was 
multiplied by 67449 and compared with the probable errors of the equations. 


The values calculated from equations (b), (g), and (h) fell within this probable 
63—2 
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error. The others exceeded it, but the greatest difference was only ‘08, i.e., since 
we are working in centimetres, less than 1 millimetre. 


The formulae were also tested for five pelves measured by us. Three of these 
were ancient Egyptian (I and II female, III male), and had to be fitted together 
to be measured, but the other two were modern English (IV female, V male), with 
the ligaments attached. The inter-tuberal diameter was not measured since it 
would have required the application of Emmons’s method, and this we were not 
prepared to carry out as it had already been found that, in the determination of 
the obstetric conjugate and the transverse diameter, nothing was gained by the 
use of this diameter. Equation (d) was therefore not tested. The differences of 
the calculated from the recorded values for these pelves are given below for each 
equation. 


TABLE IV. 

Pelvis Number... I II Il IV Vi 

Equation (a) ... 1°33 39 — 28 “15 — 64 
o (@) ace 83 — ‘76 BQ 2°65 33 
- (ec) ce || 1:85 “40 — '26 ‘19 — 52 
o (Q) Bee 44 —1:09 ‘07 2°00 -— ‘15 
3 GA) cca. eS “59 -— ‘01 28 — 32 
" (g) ... | 1:49 — ‘21 —'51 2°24 —1°35 
DU? NS 123 Om 5S 2-42 | —1°43 
5 @ee, Hae 7 yi — 23 ‘07 — 45 
¥ GD) ccae ll ees) 370 0-| — 123 08 — “46 


Perhaps the most interesting of these numbers are those for the modern 
English female pelvis IV. Here the results for the equations formed from the 
diagonal conjugate and the height of the hip bone, (2) and (j), are excellent, the 
differences between recorded and calculated values being only ‘07 and ‘08 cm. 
The equations formed from the diagonal conjugate singly (a), and the diagonal 
conjugate and antero-posterior diameter (c), are the next best, the differences 
being 15 and ‘19 cm. respectively, then that from the diagonal conjugate and 
pubic height (/), with a difference ‘28 cm. All the others have differences 2 cm. 
or more. 


The figures bring out strongly what our previous discussion on the probable 
errors has shown, namely that, excluding, for reasons considered, the equations 
formed with the pubic height, the best equations are those containing the diagonal 
conjugate singly or combined with the height of the hip bone or the antero- 
posterior diameter. 


The equations (2), (7), (a), (c) and (f) give differences less than a centimetre 
in four of the five pelves. No other equation does this except (b), but the 
difference for this in the remaining pelvis, which happens to be the modern 
English female pelvis IV, is 2°65 cm., so that the equations which give the best 
results on the whole are those which give the best results with pelvis IV. It 
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should be noted that all the equations give fairly good values for the male Egyptian 
pelvis III and most of them for the male English pelvis V. 

The best of the formulae obtained from the measurements of the American 
Indian female pelvis for finding the obstetric conjugate are therefore applicable 
to the modern European female pelvis, and will give good results even for the 
male pelvis. 

(B) Transverse diameter (brim). The inter-cristal (‘61), inter-spinous (°52), 
and antero-posterior (‘35), are the diameters most closely correlated with the 
transverse diameter. By taking them singly and combined, equations for finding 
this diameter were formed. Equations were also formed from the height of the 
hip bone taken singly or combined with the diameters. This bone measurement 
has proved serviceable in finding the obstetric conjugate and was used here, as its 
correlation with the transverse diameter also is fairly high (right “48, left °45). 
The following equations were obtained: 

(k) T='305K + 5-096 +°386. 

(i) T=:228S4+ 7-791 + -415. 

(m) T='284A +9°673 + °455. 

(n) T= 261K +0498 + 4864 + °385. 

(0) T= 221K + 0618S +-160A + 4044 + 374. 
(p) T= '341R + 6398 + °427. 

(q) T= '320L + 6794 + 433. 

(r) T=:248K +:144R + 3°799 +378. 

(s) T=-259K +-115L +4078 +381. 

(t) T='169S + ‘221K + 4883 + °392. 

(uw) T='1748 +1920 + 5323 + 398. 

(v) T= 202K +0508 +145 Rk + 3°835 + 376. 
(w) T= 217K +0478 +1130 4 4155 + 380. 

Of the equations formed by taking the measurements singly, namely (k) from 
inter-cristal, (1) inter-spinous, (m) antero-posterior diameter, (p) height of right 
hip bone, (qg) height of left hip bone, (/) has the lowest probable error, *386. This 
is slightly improved (385) by combining the inter-spinous with the inter-cristal 
diameter as at (n). The inter-cristal diameter combined with the height of the 
hip bone gives a still lower probable error (right °378, left °381), as is seen in 
equations (7) and (s), while (¢) and (w) show that the inter-spinous diameter and 
the height of the hip bone together give a higher probable error even than the 
inter-cristal diameter alone (i). By combining three measurements, however, 
there is noticeable improvement. Equation (v) formed from the inter-cristal and 
inter-spinous diameters and the height of the right hip bone has a probable error 
‘376 which is slightly lower than that of (r), from the inter-cristal diameter and 
height of right hip bone (378); and (aw), from the inter-cristal and inter-spinous 


498 The Measurements of the Pelvis 


diameters and the height of the left hip bone, shows similar improvement as 
compared with (s), from the inter-cristal diameter and height of left hip bone 
(‘381 and 380 respectively); while (0), from the inter-cristal, inter-spinous, and 
antero-posterior diameters, has the lowest probable error, ‘374. 

The best formula then for finding the transverse diameter from a single 
diameter is (k), from the inter-cristal diameter. For two measurements, (7) 
and (s), from the inter-cristal diameter with the height of the right and left 
hip bones respectively, are slightly better than (n), from the inter-cristal and 
inter-spinous diameters, and any of these can conveniently be used as all the 
measurements are easily taken. In the combination of the inter-cristal and 
inter-spinous diameters with a third measurement, the antero-posterior diameter 
gives a better equation (0) than the height of the hip bone, (v) and (w), but the 
measurement of the latter is less objectionable to the patient. 

These formulae were tested in the same way as those for the obstetric con- 
jugate. The transverse diameter was calculated from the recorded measurements 
of 25 of the 216 pelves, picked out at random. The square root of the mean 
square deviation of the calculated from the recorded values of the transverse 
diameter was multiplied by 67449 and compared with the probable errors of 
the equations. In every case they fell within this probable error. 

The formulae were also tested for the five pelves measured by us (I and II 
Egyptian female, III Egyptian male, IV English female, V English male). The 
differences of the calculated from the recorded values for these are given below 
for each equation. 


TABLE V. 

Pelvis Number... I II Ill IV Vv 

Equation (/) ... — 58 —1:04 -1:18 — ‘27 — 1°57 
3 (Ole — 99 — ‘87 —1°58 +'17 | —1:48 
5 (Mm) se — 1°30 -1:'138 — ‘21 +31 —1:08 
3 (n) ... — 33 -— ‘74 —1:06 +08 —1°30 
3) (O)ieeee — ‘51 — *93 —1°'16 +:°09 —1:40 
5 (p) .. — 1-00 — *86 — ‘86 — 04 —2°12 
5 (OO) 4 — 1-09 — ‘83 — °83 + 05 —2°'10 
(r) = Dl — ‘90 — 1:22 — *20 — 1:86 
; (s) = byl — ‘91 —1:22 —'18 —1°81 
5 (t) — "74 = id — 1°56 +15 — 1:95 
4 (w) — ‘81 — "77 — 1°45 +°20 —1:'90 
5 ) 0. — 52 - ‘87 —1°'37 —'12 —1°85 
. (Qo) es — ‘59 —- ‘91 | —1:38 -—'13 — 1°83 


Two things are at once obvious, (1) that the best result for every equation is 
given by the modern English female pelvis IV, (2) that the results for the two 
male pelves III and V are not very good. In the case of the equations (0), (v), 
(w), (7), (s), (n) and (k), shown to have the least probable errors, the differences 
for the three female pelves, I, II, and IV, are all less than a centimetre except 
that of (&) for pelvis II, which is a little more. The best results for the English 
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female pelvis IV are given by (p) and (q), but their results are amongst the worst 
for pelvis I. The table shows that the equations selected from a consideration of 
probable errors may be expected to give good results, that these formulae may be 
applied to the modern European female pelvis, and that they cannot be satis- 
factorily applied to the male pelvis. 


(4) The effects of moisture on the pelvic measurements. 


The formulae which we have obtained will give the obstetric conjugate and 
the transverse diameter for the dry pelvis. These will not necessarily be the same 
in the living subject for there the bones contain moisture and are covered by the 
soft tissues. For practical purposes it is required to know what etfect these 
differences will have on the measurements. 


The effect of moisture was studied by soaking a dry pelvis in water and 
comparing its measurements in the dry state with those taken after it had been 
in water for some time. This was done with two ancient Egyptian female pelves 
lent by Dr Derry. The bones of the pelves were measured, then fitted together 
and kept in position by strong elastic bands, and the diameters measured with 
eallipers. The pelvis was then taken to pieces and the bones immersed in water. 
After an interval they were taken out, held up to drip, and excess of water removed 
from them with a cloth. They were measured, fitted together again, and the 
diameters measured. This was repeated at several intervals. Finally they were 
taken out of the water and left in the air to dry for some days, and the bones and 
diameters were again measured. The results were unsatisfactory, as the following 
example, giving the measurements at different intervals, will show. 


TABLE VI. 
| | | . 
Pelvis I Dry | In water | In water In water | In water In water aoe oune 
2hours | 4 hours | 7 hours | 70 hours | 97 hours tA dave 
| | days 
5 ie | | 
Inter-crests... ate 22°30 22°52 22°45 22°39 | 22°82 23 °02* 22°79 
Inter-spines... ee 19°81 19°95 I) 38) 19°91 | 20°23 20°52 20°23 
Transverse... va 11°33) | 11°30 11°23 AV25 a) sleod 11°34 1 Ie 
Diagonal conjugate... 11°18 11°30 11°30 11:23 11-13 | 11:28 10°96 
Obstetric conjugate ... | 10°71 10°70 10°70 10°69 ! 2 2 
Antero-posterior «s. | 10°42 10°25 10°52 10°28 | 10°51 10°35 10°31 | 
Pubic height ... ace || 8 2°30 2°45 2-47 | ? ? ) | 
Right hip—height ... | 17°40 17°50 17°47 17:50 | 17:50 17°53* | 17°42 
Me breadth =... 14:29 | 14:20 14°15 14°17 14°16 14:15 14:03 
Left hip—height ea 17°60 17°75 17°60 WZPARO) 1) LA AShO; e637 17°50 
breadth: s.. 14:02 | 13°85 13°85 13°88 | 13°83 13°85 13°60 
Sacrum—height ss S728 8°80 | 8:86 8°87 8°87 8°70 
5 breadth —... 10°30 | 10°16 1017) | 10713 10°18 10°14 10°07 


Obviously the changes in the measurements are small and therefore not easy 
to detect where there are opportunities for error as in this case. The initial 


* Both hip bones were cracked at the crest. 
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difficulty of putting the bones together on different occasions so as to get constant 
measurements is increased under the conditions of the experiment. After the 
bones have taken up water the shape of their articular surfaces is altered; hence 
the bones do not fit together in quite the same way as they did before, and the 
measurements are not fully comparable. A difference in the tilt of the sacrum, 
for example, may affect the two conjugates and the antero-posterior diameter. 


But there is another difficulty. These old bones are very fragile and are apt 
to crack, as both hip bones did in this experiment. Flakes of bone come away 
from the surface. This happened at the upper part of the pubes in this pelvis, 
and is the reason why the record for the obstetric conjugate and pubic height 
is incomplete. Fortunately we were able to surmount these difficulties. 


Prof. Thane lent us two modern English pelves, one female and one male 
(previously referred to as IV and V respectively), with the bones still attached 
to each other by the ligaments. Both were measured in the dry state. Pelvis IV 
was measured after immersion in water for 22 and 68 hours, and subsequent drying 
for 39 days. Pelvis V was measured after 98 hours in water, and subsequent 
drying for 35 days. The final measurements were very nearly the same as the 
initial, so that the mean of the two was taken as a good measure of the diameters 
in the dry pelvis. In the case of the first pelvis (IV) the numbers after 22 and 
68 hours in water were also very close, and the mean of these was taken as the 
measurement in the moist state. The measurements in the dry state with their 
increase in the moist state are tabulated below. 


The negative sign in the fourth column indicates that moisture caused some 
diminution of the measurement in question. 


TABLE VII. 
Preiyis IV ? Prxtvis V ¢ 
Mean 
of 
| Differences 
Dry | Difference Dry Difference 
Inter-crests  ... Bie 26°26 10 26°54 | 18 14 
Inter-spines... ee 21°40 02 23°29 04 03 
Transverse... ee 15°85 06 11°63 ‘08 ‘07 
Diagonal conjugate... 14°14 | 15 12°70 ‘10 13 
Obstetric conjugate... 12°42 “00 10°32 02 ‘01 
Antero-posterior sae 10°11 13 10°71 03 08 
Pubic height... sco fC 22 4:20 21 "22 
Right hip—height —... 19-03 ‘07 21°55 05 06 
58 breadthis =. 15°78 06 15°87 — 07 — 01 
Left hip—height ee 18°79 eal 21°70 10 ‘ll 
sy) ee breadithiaeees Wry “05 15°87 ‘03 04 
| | 


The differences are small, the greatest being a little over 2 mm. for the pubic 
height. Moreover they are very close for the two pelves. The means of these 
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differences for the two pelves are given in the last column. In future calculations 
the numbers in this column are used to indicate the effect of moisture on the 
different measurements. It follows that, in the moist pelvis, all the diameters 
and the heights of the hip bones are slightly greater than in the dry pelvis. 


(5) The thickness of the soft parts. 


We have been unable to find any very satisfactory record of the thickness of 
the soft parts of the pelvis. The soft parts affect our calculations. An internal 
diameter such as the obstetric conjugate measured on the tissue-clothed pelvis 
must be less than it would be if measured on the moist bony pelvis, by the 
thickness of the tissues over the sacrum and pubes. An external measurement 
such as the height of the hip bone is greater on the living subject than on the 
moist pelvis by the thickwess of the tissues above the iliac crest and over the 
ischium. The thickness of the tissues which increase the height of the hip bone 
as well as that of the tissues over the ilium in the inter-cristal diameter have 
been most kindly measured for us by Dr Derry on twenty adult female post- 
mortem subjects in the manner already described. The details of these measure- 
ments will be found in a table at the end of this paper. The averages only are 
given here. 


TABLE VIII. 


Inter-cristal diameter... ues wee ae 28°63 
Thickness of tissues in this diameter vee be 
Ischio-iliac diameter ase Hes : Sas 
Thickness of tissues above crest of ilium ae ae 
Thickness of tissues over ischium a ee be 


The inter-cristal diameter on the living subject is therefore greater than that 
of the moist bony pelvis by °70 cm., that being the combined thickness of the 
tissues on the two sides. The ischio-iliac diameter, Le. the height of the hip, 
is greater on the living subject by *47+°49 or ‘96 cm. on the right side, and 
‘474+ 47 or ‘94cm. on the left side. 


The tissues over the promontory of the sacrum ‘and the posterior surface of 
the pubes were not measured. They are almost negligible. Two diagrams drawn 
to scale and most generously placed at our disposal by Prof. Symington show the 
differences for a thin and fat subject. In Fig. 1, Plate XXVIII, the median 
section of a rather thin woman 35 years old, the tissues over the promontory 
measure 2mm. while those over the pubes cannot be measured on account of the 
distended bladder. In Fig. 2, Plate X XIX, showing the left half of the median 
section of the pelvis of a rather fat woman 30 years old, the tissues over the pro- 
montory measure 6 mm,, over the pubes 4mm., i.e. less than over the promontory. 


Biometrika 1x ae 
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These numbers, however, do not give a good idea of the thickness of the tissues in 
practice. The tissues would be compressed by the parts of the foetus, and, being 
composed of loose connective tissue, would take up much less room. If an 
attempt be made to measure them in a compressed condition the measurement 
is so small as to be almost negligible, and the chances of error are great. In 
what follows 1 mm. has been considered a just estimation of the thickness of 
the tissues over the surfaces of these bones under these conditions. The tissues 
will therefore diminish the diameters, as measured on the moist bony pelvis, 
by 2mm. for the obstetric conjugate, diagonal conjugate and antero-posterior 
diameter, and will increase the pubic height by 1 mm., if measured internally. 


With regard to the transverse diameter at the brim 2 mm. will also be a good 
allowance. Just at the brim, where this diameter is measured, there is nothing 
but loose tissue. The external iliac vessels and the psoas muscle are well above 
the brim, and it seems hardly likely that they can be pushed over it so as to 
diminish the transverse diameter. 


(6) The finding of the obstetric conjugate and the transverse diameter for the 
living subject. 

All the necessary information has now been collected for finding formulae to 
determine the obstetric conjugate and the transverse diameter in the living 
subject. These formulae can be obtained from the equations previously given by 
allowing for the effects of moisture and the soft parts. The same letters are used 
to indicate the lengths of the diameters, but are printed in clarendon to denote 
the length in the living subject. Consider the obstetric conjugate in the living 
subject. It is smaller than that of the moist bony pelvis by the thickness of the 
tissues over the bones, 2 cm., hence, if this thickness be added to it, we get the 
obstetric conjugate of the moist bony pelvis. This again is larger than that of 
the dry pelvis by ‘01 cm., the increase due to moisture. If ‘01 cm. be subtracted, 
therefore, the obstetric conjugate is obtained for the dry pelvis. This is expressed 


by the equation 
C+ :20-01=0. 


The same argument applies for the diagonal conjugate and antero-posterior 
and transverse diameters, the equations for which are 
D+ ‘20-13 = D, 
A+°20-‘08=4, 
T+ 20-07 =T7. 


With the inter-spinous and inter-cristal diameters, the pubic height, and the 
height of the hip bone, the case is different. The thickness of the soft parts 
makes the last three greater in the living subject than on the moist bony pelvis 
and, to obtain them for the latter, this thickness must be subtracted. It does not, 
however, affect the inter-spinous diameter for which therefore no correction for 
soft parts is required. The measurements of the moist bony pelvis are, as with 
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the other diameters, larger than those of the dry pelvis, so that, to obtain these, 
the increase due to moisture must also be subtracted. The equations are 


K—‘70-—'14= K, 

S-— 0--03=S, 

P —‘10—:22=P, 

R—-'96-'06= Rh, 

L-—94-‘1ll=Z. 
If these values for C, D, A, T, K, 8S, P, R, and LZ be now substituted in the 
original formulae (a) to (w), the new formulae produced give the measurements 


on the living subject. In the following list these are also lettered (a’) to (w’) to 
indicate the original formula from which each was derived. 


Obstetric conjugate. 
(av) C= ‘978D —1°675 + °299. 
(b.) C= 365A + 5935 + °705. 
(’) C= ‘972D+4 023A — 1°862 + 298. 
(e) C= ‘110P + 9:785 + ‘738. 
(f’) C= 989D— ‘165P —1:297 + °293. 
(7) C= 590R—1:829 +619. 
(h’) C= ‘602L — 2:132 + ‘610. 
(i) C=1008D — -049R — 1-047 + -298. 
(7) C=1005D— 042L—1:151 + :298. 
Transverse diameter. 
(k’) T= 305K +4710 + 386. 
(l’) T= 2288 +7654 + 415. 
(m’') T= ‘28444 9°577 + °455. 
(v') T= :261K+ 0498 + 4513 +385. 
(o) T= -221K+ 0618 +-160A +3°747 +374. 
(p’) T= 341R+4 5920 + 427. 
(7) T= 320L + 6328 + -433. 
(’) T= '248K+ 144R+3°313 + 378. 
(s') T= :259K+ ‘115L 43610 +°381. 
@) T= 1698+ -221R + 4522 + 392. 
(wv) T= 1748 + 192L4 4986 + °398. 
() T= -202K+ 0508+ 145R +3387 +376. 


(w’) T= 217K + 0478 + (113L4+3°728 + 380. 
64—2 
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With this large number of formulae to choose from there should be no difficulty 
in getting a very close approximation to the lengths of these two non-measurable 
diameters. Since the desire to attain an end with the least possible exertion is 
a weakness of human nature, the majority of those who favour these formulae 
with a trial will, no doubt, wish to use a single measurement, and especially one 
from which both diameters can be found. Such measurements are the antero- 
posterior diameter in (b’) and (m’), and the height of the hip bone, right in (9’) 
and (p’), left in (h’) and (q’). They must not, however, expect such good results 
as the man who measures the diagonal conjugate and the inter-cristal diameter 
to obtain the obstetric conjugate from (a) or, more approximately, 


C= 98D — 1°68, 
and the transverse diameter from (/) or, more approximately, 
T=31K+471. 


With a little extra labour the height of the hip bone can be added to the list. 
If H represent the height of either hip bone in the living subject equations (7’) 
and (7’) may be combined to give approximately 


C=D- ‘05H -1, 
or C=(D—1)— 5H. 
Similarly (7’) and (s’) may be thrown into the combined form 
T=1(K+10)+ 4, (H+10). 
We thus arrive at the following simple rules : 


(i) To find the obstetric conjugate subtract one-twentieth of the height of either 
hip bone from the diagonal conjugate diminished by one centimetre. 


(11) To find the transverse diameter add one-quarter of the inter-cristal diameter 
increased by ten centimetres to one-tenth of the height of either hip bone increased by 
ten centimetres. 


(7) On the application of the formulae to the pelves of dwarfs. 


It is important to know what reliance can be placed on these formulae in 
cases of extreme pelvic deformity. In an article on “ Dwarfism” by Rischbieth 
and Barrington (Treasury of Human Inheritance, Vol. 1, p. 355, 1912) several 
measurements of dwarfs of various types are recorded. In nine cases measurements 
of the pelvis are given which include the diagonal conjugate, but in only four of 
these is the obstetric conjugate also given. The obstetric conjugate has been 
calculated from the diagonal conjugate for these nine pelves by means of the 
equation 


C='98D + 1°68 
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with the following results: 


Number | Type of case | D C (observed) | © (calculated) 
1 Achondroplasia | 7 — 518 
2 95 s) — 7:14 
3 Ateliosis 9°21 7°62 7°35 
4 ” | 8°26 7°62 6°41 
5 . 9°95 — 8:07 
6 ss 8:5 om 6°65 
4 Rickety dwarf 7 6°25 518 
8 5 rs 8°75 = 6°90 
gy Uncertain type 8°47 7°62 6°62 


In No. 3 the calculated and measured values are very close, the former being 
slightly the less, but in 4,7 and 9 the calculated value is about 1 cm. less than 
the observed value. In 4 the height of the hip bone was given, so that we were 
able to find the obstetric conjugate from the equation 


C=(D-1)-yH. 


The value obtained, 6°57, is a little nearer that of the observed value, 7°62, than 
is the value obtained from the diagonal conjugate alone, 6°41. 


The last column shows that all the pelves are of such a degree of contraction 
as would call for special treatment during pregnancy or labour. The deformity 
would in most cases need no calculation to make it obvious. Even in extreme 
contraction, however, the formulae may be useful. In considering the advisability 
of Caesarian section in such cases, it is necessary to know whether the pelvis will 
admit the passage of even a dead child. This can be readily ascertained by 
applying the formulae. The tendency seems to be for them to give too iow 
values in these extreme cases. With the present-day results of Caesarian section 
this is to err on the safe side for both the mother and the child. 


SUMMARY. 


1. Methods for obtaining the height of the hip bone on the living subject 
and for measuring the thickness of the tissues over different parts of this bone 
on the corpse, have been described. 


2. The averages and variability of the pelvic measurements have been found 
and the various measurements correlated. The obstetric and diagonal conju- 
gates show very high correlation. The inter-cristal is the diameter most closely 
correlated with the transverse diameter. Both the obstetric conjugate and the 
transverse diameter show good correlation with the height and breadth of the 
hip bone. Corresponding measurements on the two hip bones are more highly 
correlated than two measurements on the same bone. 
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3. Formulae have been obtained for finding the obstetric conjugate and the 
transverse diameter on the dry pelvis. The former may be found best from the 
diagonal conjugate, singly or combined with the antero-posterior diameter, 
the height of the hip bone, or the pubic height. The transverse diameter is 
found from the inter-cristal diameter, taken by itself or combined with one or 
two of the following: inter-spinous diameter, antero-posterior diameter, height 
of hip bone. The formulae were tested and found to give good results for the 
modern English female pelvis. ° 


4. Pelves were measured dry and after immersion in water to study the 
effects of moisture. All the diameters and the heights of the hip bones are 
slightly greater in the moist than in the dry pelvis. 


5. The way in which the soft tissues affect the measurements has been 
pointed out. The thickness of the tissues over the bones bounding the internal 
diameters is almost negligible. 


6. Finally formulae have been arrived at for finding the obstetric conjugate 
and the transverse diameter in the living subject. The simplest of these, for the 
obstetric conjugate, are approximately 


C= ‘98D — 168, 
C=(D-1)-ahH, 
and, for the transverse diameter, 
T= 31K +4471, 
T=1(K+10)+7,(H+10), 


where D is the length of the diagonal conjugate, K that of the inter-cristal 
diameter, and H the height of either hip bone. 


In conclusion I wish to thank Prof. Karl Pearson and all those who have so 
generously come to my help—Prof. Symington for placing his diagrams at my 
disposal, Prof. Thane and Dr Derry for the loan of pelves, Dr Derry and Dr 
Braxton Hicks for their measurements on post-mortem subjects, and Mr Soper 
for the useful little instrument with which most of these measurements were 
made. To Prof. Pearson is due whatever merit this work may possess. It has 
been carried out under his guidance without which it could never have been 
attempted. 


APPENDIX I. 


Measurements of five pelves used for testing formulae. I and II ancient 
Egyptian female. IIT ancient Egyptian male. IV modern English female. 
V modern English male. In IV and V the bones were connected together by 
the ligaments. 
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TABLE A. 
2 oe ai 
Measurement I II | Ill IV 
Inter-crests 22°30 24°92 | 28°28 26°26 | 26°54 
Inter-spines 19°81 21°25 | 27°75 21°40: | 23°29 | 
Transverse MWE33) ITEGi | L2e bo: 12°85 INE6Sa 
Diagonal conjugate 11°18 10°75 12°68 14°14 12°70 
Obstetric conjugate 10°71 845) 10°57 12°42 10°32 
Antero-posterior 10°42 | 11:03 10°88 10°11 10°71 
Pubic height 2°32 3°88 4:47 3°72 4°20 
Right hip—height 17°40 17°98 | 20°55 19°08 | 21°55 
Pe breadth 14:29 | 14°32 | 15-41 15°78 | 15°87 
Left hip—height 17°60 17:87 | 20°62 | 18°79 | 21-70 
5) breadth 14:02 14:49 | 15°80 | 15°77 15°87 
Sacrum—height 8°72 9°35 | 10°93 — = 
breadth 10°30 10°85 | 11°83 — — 
| | 


subjects to show the thickness of the soft parts. 


Table of measurements taken by Dr Derry from 20 adult female post-mortem 


TABLE B. 
Thickness | Thickness | Thickness | 
of tissues Ischio-iliac of tissues | of tissues | 
Inter- in this diameter above crest over 
Age| cristal diameter of ilium ischium Remarks 
diameter 
Rei R L Rell Mowat wee |) ae 
| 
61 31°15 *65 | “79 | 19°95 | 20-00 45) 55°50 | °55 | not very fat but well nourished | 
85 | 26:35 | 48 | -49|19-75| 19°60 | °55| -60| -50 | -40 _ z a 
18 | 28-50 | °45 | -40 | 20-00] 19-90 | -39| -40| -60 | -46 i _ - 
56 29-00 *55 | *50 | 23:00 | 23:00 *80 | 80) °50 | °45 mA - a, 
57 28°50 *55 | -60 | 23°20 | 23°55° | 1°00 | °85] -55 | -45 | stout 
54 | 25°00 | 40 | *40 | 20°30 | 20°35 05 | 45 | °50 | °50 | slight layer of fat only 
55 | 98:95 | -35 | 40 |21-15| 21-65 | 50/50] 50 | -45/ | i . 
62 32°70 *30 | °30 | 22°95 | 23°00 "40 | -45| °50 | -70 | thin 
Bil 26°95 *20 | :10 | 20°50 | 20°40 30°20) 40 | “45 | very emaciated 
65 28°50 *20 | °35 | 21-00 | 21°20 *30 | -45 | -25 | 30 | thin 
28 28°75 *20 | *20 | 21:20 | 21°40 “35 | *35-| 40 | *30 emaciated 
59 30°30 *35 | °35 | 23-00 | 23°00 *3d:| 35:| °40) | +50 | thin 
20 28°50 *30 | °30 | 21°70 | 21:90 “60°50 60 | °55 | thin layer of fat 
55 27°30 “40 | *50 | 22°00 | 22°50 *50 | 55} -60 | -65 | fat 
39 29°10 B80 | -25 | 21:05 | 21°35 ‘45 | -40} °50 | °45 | thin 
| 28 30°30 *35 || 35 | 23°35 || 22°40 35 | 40) 40 | °30 | fairly fat 
| 38 30°25 *20 | °B0 | 22°45 | 22°85* 40 | 45] °55 * | thin 
45 27°10 NO} |) 220) 12230) 29°25 ‘45 | -40| -60 | -60 | 3 
22 26°10 20 | *15 | 21°25 | 21°35 ores2ou 260) |) Dp es 
43 29°30 *20 | :20 | 20°75 | 20°75 ‘40°40 -40 | -40 - 


* Some swelling over ischium, 


APPENDIX IL. 
CORRELATION TABLES OF THE DIAMETERS WITH ONE ANOTHER. 


Transverse. 


TABLE II. 


Inter-cristal. 


TABLE I. 


The Measurements of the Pelvis 


mt Hd HB 29D 


19 
—GP. DAN 

ett! 1111988 itis l, 

e 1010 1D 

So OM Male er | + 

| bs OC Oe oe NN 

4 10 1010 1 

| —GG-&§T | | Hat ankan | | ~ 

| oD 
aes ae 

—GL ETI OAMROMADDOHH a 

al ae ns 

2 ir a i = 

—G3.8T 1 19 109 10 12 A 


wd 1d 


| IO I~ 


LOMA dA 


1d Ad) 1d 


| 
| 


Lt 
FESEREN 


we Ne) 
a Io Qi Ni~ I~ nn ~ 
—G6- hi oY ee ea les] 
OC ee | rt 
1910 19.20 Ye) 
—G!.97 fe) OI ANN oO 
AacdG B i 
Sis 1 Hig il nN 
Yen} Yen} Ne) 

—G6-96 10 GI od ow 
ne 
—GL-G2 a 
ise) 
ken) 
—G6-G6 
ie 
le) 
GIVE Aw MMAR fo) 
(0) 
c ad ud Yes) 
—Gb-16 +HOoOmMm |” Ne) 
= 
1020) 10120 es) 
~ oO ANG : 
—GL EB N N N N le) 
—AN MNO COO eo 

Ne} a) 10 210 
— GG. lel INS NAGA a 

Solo 

be 1 1d Oe, 

— Gh 66 Se oe 
ea ba ° 
se) 
j=) 
al 


-* ‘aqyeSuuoo ouqeysqg 


TABLE III. 


Nn 
3 wis wus 
6 AMrAAONDMDADMAHON | © 
an AN HO AG 
Gk 96 Pareles) ey | 
6 a. ! wD 
— 96.96 Pleas lie 
10 
—GheGE | eee ta | 
; we (Soho 
—Gb-GE | | S22 NN | | 
Anna 
10.10 
—G..t¢ DID AN | 
oe NN ANOOM 
10 210 
—Gb. 4% rr J 32 | 
aoman fol tal 
1929 19210 
—G).E% ro) daAN I~ WN | 
ee AOROOHNAAA 
a 1 1d 
—G7.e7 12 19 19 he) AN 
ERG SOnMDOAMAS es 
rors) 1d 1d 
Hi Bo ell coe seoseeae rete 


1019 
[ac | A 10219 i 
Ss ONMHONHN 


Inter-spinous. 


a MA 119 
a 2 : 
Gk Tb SSeS | | 
ss ra Ye) | 
S616] | lSadanaan | 
1929 1919 
=2i08| 112, ae ee 
opal || 1 
46-06 ATHAON 
10 
—¢.-.6r | | | alee alia 


UO os Eiken oe en 


UD 1D 15 19. 1H 19 19 1 1H 


NE RVRHVEVHS 


NB 
mRODMAADSDON 
™ =n 


Totals 


‘ayesnfuod 9114048qO 


a aa 


Antero-posterior. . 


TABLE V. 


Diagonal conjugate. 


TABLE IV. 


D. H. dE Souza 509 


—G6-41 = 
10 ener = = 
—GL-ET : 
3 | | | le ee 
10 
—S6-&T P | 
| | =| 
en) —— — = = 
—GL-6T co | | | 
a i | Ue! eo 
Yen} . | | : : 
= : , = ° 
G6-61 M2 GO HONARa a a s 
ne | 
1D 1D 12 10 fe) || —— | |aAlalta + 
SLIT 4 HOMOM AN 4 D s fo ee See 
— — SS = a = Oy | AAAS ay 
aS 2 191919 2 S om aes 
GIT) Cal aaa = = - _ ~ aay _ 7 
eee: Se = oO j 19 19 19 pe) 
a5 = 5 
1019 19 1010.18 2 | La AAttAHN RAR = 
ANr~-ONR ee 2 i : 
ee — jae | 1D Ney 1D 1 
: es | log Yous Yon S tao en Cy I len) |= foal ine) 
a aa 
| - 10 20 102 1D 
| —96- | J Hom+eona (= oD 
. | cl I Yen) 
eH = 
aa 101019 18 
| -—GL-9 | |omoonmwam se | a ~ 
con = = 
4 iors 10 | a 
c 9 i lo 
faa) Ue Om nwo eA He ae N 
= =) 


| oP | | me | | x 
8 S6.1T al aA Ne} 
a 
2 
i Ormt | aN lon) 

s |— 94-01 | | | | a 
= — — $$ 
7 = 

Pe 1010 BP 10 SH 19 10 | Lal 1~ 
2 |—9-0T | | 23 
vm 
fc = = — = 
SH 


10 1d pe) 1D 
OO FO DOr N10 4 ie 
ee 


1 | 1D Ad 


G6-6 | lmowmwnamtan ! a 
wd 10 1 

CL.8 re Te | 

Amato AMAR ODWOAHH 


lame nioaa 


WD IQ TF IQ 
WN X co 


TABLE VI. 


1 UD 19 19. 19 19. 1 ID WH IG 
NA RNKNKKRKVK 
RROHAASOHN 

Ny mn 


Totals 


‘ayesn tod o1130938qQ, 


‘oyesn(uos o11}038q0 


Biometrika 1x 65 


Transverse. 


TABLE IX. 


Obstetric conjugate. 


WADE 


TABLE 


510 The Measurements of the Pelvis 


2 Ney 10 
—G6.4T i erat me ra 
11d Ne} Pa 
—$L-ET eee een a 
Yen} pe} 10 1d 
—GG-ET mts é or 2 
ral a 
Oe 9-97 Pe 
GL-ET ne oe oe 
ere’ a ee) ey 19 
> 2a fe —G6.9% a lee 3 
wd ud 
- Nex Ne} 
AAA 10 P= B~ 19 Ww =~ c ic 
—$6-6T SaaS oS Ghe Ga | on a 
ae N 
= is x) 19 
9 6 —9- 96) |al lad © 
TGhITL A | mOomNtHO eo 
= ey Ne) 
? 948] | laws a 
1010 
—GB.16 |S aang, nN 
10 10 
=H) (ary H c ee 
Gh GG | £109 4 & 
= Hone oD 
—-G6-&6 4 a 
=p PD | xt 
ALASCO 10 COOH AN 
Gh GE | A M10 oH | SSH x 
| oD nN Yo) 19 
os 19 ie S | 96-66 aM an © 
+ | —GS6-G6 02 I~ 0019 | aA | 
Z | & ecm 
ol | a ee I oe as ae i —C Y O° 
S) ite) et GLTEP A | AOI le 
an (oy te > Ques ip) o 
g AAacle I~ MOr~n 2 ra 10 10 S 
[=] Ss 5 | —G6-1TG | nN AKO | onl 
Ll wd ad te) 
—G6-16 ee 1D 10 1910 19 
ee rl AMA10 4 4 onl —GL.06 HON A | a 
aes a = ies 
GLEE THMoS!O FS —95.00" | | ne | Jo 
9 —GL6Tf 7 | | Nea | apo 
—Ge.6T| | aa | — Sa 
— 9-87 | aes 2 
S s ae zs 
H 
—GB-81 Pri Poet ° 
pe) = 
5A) 
See Petit) lille 
fe) 
al & i | 98.2 eee 
nN a i GGT 5 5 
oO a L 
o] a | ai ra | | else po Sae wists 
UW WW ID ID ID IH & q arte Lal foal 
Si Re x 3 LAIRG S =) 
& SHHRRHOS | A faa) (Si aes 4 
30) 1d 19 19 19 15 19 1912. 
Ree = SSEILISLS | s 
yYoroy oIqud SHHRRSHS ITE 
ee) 
qYSoy oqng 


yySioy orqng 


Antero-posterior. 


TABLE XIII. 


Diagonal conjugate. 


TABLE XII. 


Soros |, {il | ts 
Sosaas laaawa | bo 
88 B 
way | |_ Ss 1) |e 
Peas 12 
—-8I| | laaann!l |e 
eo nN 
9 1% 
—GA-TT eitees'oo | | ioe) 
a io) 
a: 
—G.IL| aantoora| do 
ce Pi 
= Ss Paes 
—GL0T| a | aAmoane ~ 
a “H 
19 1019 9 
—G3-01]} | |+uad am 
oH NA 
wg ad 
Gd 6 | Wesrcaieics BY 
—G3.6 | | } (eer H a 
meee S| | | l= | q 
n 
es 1 10 
S A OO HH a 
i= NO Om = 
Gee || i a Io 
Som | la. | 129 
—oL-6T) | | | aa i 
2 9 
—G6-8T] | | moto ron 
re 
1D Yen) 
—=@ihet|| | tHitaei= © 
ol nN 
—G96.2T | |Hmana 1D 
rat } 
Sa: 
—6LIT a lonat+no [Ho 
ee H 


lee 
eRERERE! 


SHUHRRHADS 


” 
2 


“qystTey orqng 


Posterior sagittal. 


TABLE XV. 


Inter-tuberal. 


TABLE XIV. 


D. H. pdE Souza 


5 
e 


36°5 


20°5 


9 


| | 
DD WW 19 19.19 
NRVLVLV 


HHRRoOS 


Totals 


—-qyroy orqng 


Inter-cristal. 


TABLE XVI. 


511 


oA 
e 


34 


© 
a 


24°5 


30°5 


16°5 


16°5 


LD UD WD 1D 19 19 19 1H UC 
SKrVMRerVWrganvs 
S MARRY > 
sy AAAS ANS 


‘OSLOASUBAT 


65—2 


Or 
bo 


Inter-spinous. 


TABLE XVIL 


The Measurements of the Pelvis 


wd ad 
arr is: || - 
wd Ad 10 
—G).e AN . 
GL-ET hy Ress | = 
—G6-61 ARS aa uD) 
|e Ot oo oe | ad 
Yen) 
Sk L | le | Se S 
x 
Ss c v 9 
a | 46-6F | | | loc man Jo 
aS) N 
g a : 
en} Yen} 
| —9- IT | | |se eS ercomimines 
° a) 
o 
= % Sones S 
< SOIT] A, | truco ees 
= 
oosiate wd 
— fet or eS si for) 
4-01 | lacaeowd = 
b iy ; we : : Yen} New) 10 
Be 96-01 a ee SS 
le) 1D 210 10 
| 94-615 | eS cole 
aa u 
<< 
<a 
; 9 
d 
2 & 
ao 
= 19 
rr) 
5 5 
wd 
| —96-6T & me 
2 AEG FDO ADoOON TS 
iS 
@ | ens] | _ SREE EES 
ml hy 
las HAO QoON | a 
Nolo) 
NW 10 es) 
IT] | lon Se00d | 
o 
= AMMAN Ne) 
aoe GLO | a 
an Sg [-#orl-lileetaile 
LD 1D 4D 19 19 19 19 1919 3 Ne = 
Bae a ge See Ess 6 iS seat | | | 
| Se SS | = P< 
= = 
OSTOASUBLT, ZA 
ee RORRERRREG F 
E VEC IL ILS |S 
SORnnRRovnos i= 
AANA NANNY 


‘OSTOASUBLT, 


Posterior sagittal. 


TABLE XX]. 


Inter-tuberal. 


TABLE XX. 


Noe e) 


home) 


| 1 1d I~ 
1 HOD HON 


awd Ad 
ols ey | He) 


ao | loSoanH 


12 19 
Lal | to a to 
ee 


wd 1d 
J Hatin ma no 
oa! 


Ja lraaae | | 


ANN 


| 
Ko) 


oS 
GLE | ha amin 
2 al 
Sarre 
—¢z.0T| | a Jonas rs 
1919 ie 
—GL-6 1 MOH Hiro 1S 
uw 
10 10 i 
—G6-6 | | ice fis 39 
Ww 1 Ye) Yon 
—GL-8 | | wooo n is 
ra 
9 0 
—G6-.8 J alaonmoua } 
AN 


alee ea bee 
SKELKL ILS |S 
SAARVQRHNHD Ss — 
ANNAN ANN 
“OSIOASUBLT, 


Diagonal conjugate. 


TABLE XXII. 


Antero-posterior. 


XXIII. 


TABLE 


Souza 


—G6-4T | Pee Las TS | aes 

—¢L.81 Oe es Sele ees 

7 es aes 10 

—G6.81 Vee ile Sacer al ese saad rl es os 

; ; - : nnn) 1919 Ne) 

—G6L-6E = | SGeaomeeqa ace 2 

ee Ss 

—— me, } Dod ) 

46-61 | DAH 6 HOM A a 

eee 20!” 3 

—¢).IT | Ney NN re) a 

sit H'AHMNONKRHH4AAN Cae Wc 

10.19 

——GZ.IT 1D 10 19 19 19 191 B= N10 | | S 

ae ANH oo + HAAS + 
1 AD a wd 1d 10> one ie) a = 

—GL OL ee a eS ron 

= =f 

_ re) ad iz le) 

— GeO Vos | | 2 Ske Slee re | to] + 

ye NANHOANANS na 


| 1 1 109 10 


HORA AR GS 


1 10 
Ne) ~ nN | 


ai 1010 ey 
—GL.61 | | 1029 10109 GI 10 0 16 a 
nN ONMBANOOA HsHOnn ol 
| ey 1d ad 9 Yo) 1d 
—G6-6T | | | ee ee 8 
ao | | | hee wD 1D 1.0 Te) 1D = 
ah TE HlaAnNntoOdnOH4M4R + 
1D 19 19 Ney 

G6 | = 1D 4 10 = 
ae. OE | | | nMen ved on ow +o g 
= (69) 1) UCD poe) Re} 
40F | oo a HH MMADHAAS na 


6 


‘snourds-10y Uy 


514 


TABLE XXV. 


TABLE XXIV. 


Posterior sagittal. 


Inter-tuberal. 


The Measurements of the Pelvis 


CG Te dl aleslin tle = 8 al Fed = 
=o Or fill al lie Bec et Pal | Jo 
—$3.01 i | | | Lt le I | 4 ee 
=9/.6 eal tah lth Tas] Ib | RS 
eC S| (el = | Pee | | |~ 
—G1.9 | | [fC ce eat | tates +] = 
ah 1 ee Res) 19 
—GB.8 ll laHnoalamoaacue | | g 
1910 192 = 19 1 
sere iy, | | ces) [HHOH+NOHOHOR MA | | 3 
Bia Oo 9 99 9 oo) [a 
O° A ! FANN HME HOOK HHNOO eS) 
ie F 
—GLQ = | | | HHAMDOMMOKNA | | | | | x 
= 
= 12 ie) D 
G6-9 PRS PASTS) | | A 
No) 


Rey 

Ll 

a 

19 

° 

Ne) 

—G6-6T nm 

—GLTT = 

lo) wa 

—G¢-1T Jee eee ee eee are es 

met ww 1d O19 

meee Oi Re he ee ie = 

. 1919.18 9 I= 

—G6-0T SR es 16S OID At | oe 
100 

—GL-6 SAAMI HAG | fo) 

pol wd 

eae 109 2 DD 29 

—~~S6+6 les ME eae SSeS Ge) 

1 1D 1910 2 

—GL8 | [Hota amonkwaanda | | SS 

2 1910 9 

—G6-8 ale larnaanaanm | an: g 
)2 2.2.0 

HOA ron 

| as arta a A ee ane 

RIS GSK PLP GEIS SKIL SEIS |Z | 

SESS SSSSSRIV II SITS SS JE | 


‘snoutds-104UT 


TABLE XXVI. 


lL. 


-crista 


Inter 


D. H. dE Souza 


ad ad 


AOooceontaonm 


wd 


wo 
aos 


9°5 
6 


1°5 


—GL-06 


‘snoutds-190qUy] 


66.16 19 | 1030  E= nq 
i on aes | 4 
1d 10 Nee te) pe) 
6) Eley ES Rt Ole xD 
oe chia me ANN OYA N 
| 1D 1 19 1 209 199 
—o7.9¢ 1D OATS eAN oe) 
AOSAG Onnn el HO = 
wd ad 1d) i Non) 
==) C2 ws H 1d Nt wo1~ NN a 
ae oO 4ARM+H+OHNOS ~ 
ad) ud Yo} 
~ On He) ~OA 110 | : 
—_G6-G6 : oh ys art | =) 
= OMAmMTONWM HO nN 
F , a a en) @ Yen) 
O10 0 a : 
—G) te ets 5 
she kG Aight ow oH a 
aq Ad 1d Ye) 
| ‘ 1d 1d 5 10 
IAS . 
—-G).¢7 be Nes Rs 
SIP HAMAMDONH = 
: 10 10 
“ ~ oA GA 
—aIOMe OD H he | 
come HSS = se 
ie) len) i 
Gh BE | HAA eee | oS) 
\ | 1 
— 96-668 | | ° 
1 10 I 
GL TO a5 seey “hth fe or) 
—9e.18 | | | li | © 
1 
a | tail ae 


Antero-posterior. 


TABLE XXVIL. 


516 The Measurements of the Pelvis 


TABLE XXVIII. 


GE Neal ghee eal Sel lal tl ie 
3 ‘ i —G8-1T emitter i hl = 
—=<GLigT | | Ise erat ee eset ia fo 
a Seay fi —GL-O1 Pome ft i | i tl le 
—¢e-er | | rae hails ili lecapera lal aes 
- : eo l(t 
—GL6T | An | AAA NMMAH © | 
a aes |=} j-we | 11 
wm 2¢ u — 
—or.7 AN 1. 10 eee acs 
ead aboaunnas!!|R| £| 26 [pili aici emn einem ie 
Ee : —2 arco Be rs on |— = ee 
SAMI TE ey) aa at = a AnrwnAaN ANR re 
aise ‘Hom DOHOO oD a a G48 ea | | | | = 
2 = ale J 
—Gé.IT 10 Laaa S " —GZ.8 | | oR | | = 
ae = O14 8 99.00 Ha = £ eis ae TMA ANTO a 
TS ae B 2 a 
Bun Deets ro) Ay =e} | oa ai | a 
GLOL al 'ASOBEBOWONAT RQ | lah 4 N OD 7 OD mM MoOoOM+tH ~H 
mE (889 © WOR ci alwicee ge © ef 6 
| —G6-01] o an laaotona la | x S G6: 4 alee oF lea domme maar 29 
Ascaeh = = = | = = 
Se £ I | gues, [es lat |e 
| GLb = i en lees r= a TSO ale tia !tortooo!sA tH 
| : ; [ete z Gene 10 19 1010 
| _4@-6 | Sl alee Aiea a ee ala) |) ee Ave) | ee |ammasueonn | lo 
| ee : << x 
; = 
Fee re (ae VO a Pe ico ate Pa | cea 


Diagonal conjugate. 


wa) 
a 
8 HOMODMOGOHHODANOH Ke) 
| ° AMNAMAN AS pe 
Sa a 
12 
o) 
a 
1 
nS) 
| 10210 Yen) ad) co 
| —G6-&T | a eee SiC fo 
ene = : | —98-11 Cae ee 
: IO sea !OnnAs We) 
oa | es (Je | 8] a 
SSE tex le a OHAN = 
Chin NL PHANNA+HMCO q | 1 
a @ | —¢L-01 trol | Alea | |e 
= = = = | te) Ceol Lame ina) HAN RMA Ae co 
eS aS ac) 10 = a 
—G6-6T nlanaroninann | 1D + —G¢.0I | | 1D 191019 | ne 
oe) w ee Rl HAAN tHtaanman |x Gr) 
= o 
10 10 = ID 10 10 10 awd 1d 
—¢L.IT GSU onmne ed eat | 5 | | ri op cy st coveshetca een Mes 
: 4 NOMA eHMHotM+H = }- homes e > 25 
_ a Ne) Ne) 10 19 
—cCr oD 
ia 0 5 | 2 S6-6 ieee bas | || 
—¢8-1T Pe | Se oe S | 5 Z 
iy Tet SOONO Seas Sou a 7 mis ws 
i e i GL. Leo oe eh IN = 
GL-01 ie || eS a a lao one Oe oD 
Jar a le AOKHOO fe NN bd = 
me i, ; 7 is ae en co ae || 10 210 Nea} 
—G6 ee Be ea com) ee 6-8 lrelieearet! | Nie 3 
96-01] arom | Pd Ses = AIHA ATM AHOS a 
——GUEG, 4 o aaa | | | We} fea} [o>] 
fa 
a 


LD 1D 49.19 19 19 29 19 19 ASSP Ld Le 
~ Ne Ne Xo 2» NR Nz fo Kere 


SUNRR a 1 tS DD RwRH OM 
RNRNAN 3 NNMNNNNNNNNYN 


Totals 


"|B YSTIO-19 UT 


"[BYSI10-19qUT 


Inter-tuberal. 


TABLE XXXII. 


Antero-posterior. 


TABLE XXXI. 


D. H. DE Souza 


=9)-272 || Use ee N= 
Somme | hes ll PLS 
lo) iO 

= (HE || 1 SN Ve eee ed | 
c iO 

eo lellada lowe !o ! | le 
eon (| |e Tt lta 
—GB.0T | CD OD Didi. SYN Ps 
polo) ino 

— G16 AAO AL OOM la | g 
—9E6 OM He [7 28 
pone) oD 

—GL8 Anatrooon per cx 
c 10 

—G6:8 | Ha tisio a [4 lS | < 


Se 


> he) oD 


wo 
oon 
AtHtwMA SG 


OR a0 A 


Ne) 
peel 
NAN 


ie 


Ne} te) 
—GL-6T | Hantaaaan | 
ec 
okie} New) Ne) 
—G6-6T | J HaoR MD ese Ee 
e) 
He) 1D 10d 10 
Gir) | | H4woawoonn D 
at wd ad) ad wdudnd | Oo 
6 sees Se eS 
GEIT] oy |= eae cera =H 
wd ad ie) ie) 
Q ae 5 . 
—SL-0L Mam aconon | a a 
a =H 
10 10 1d Ad 10 10 
—_—G6-0T Bui tina coca | st 
1d add wad Ne) 
—G6-6 a 
GLB q 
ih a Ll leat a 
= ¢ 9 
QVPELVRVNKagnge 2 
AMDBOONHNRAQHNnvss i= 
a ee ee el 


‘ayesnfuos [euosercy 


Biometrika 1x 


Posterior sagittal. 


TABLE XXXIIL 


517 


is) 


pe) wD 1D AO 


DOOGCA QE ADIN 
S i i a Bl 


—GL-6 Hae ° 

HA | NAAN = 

aoe | = 

ey) 10 

—G6-8 alamannannm |oruds 

a 

Yen) Nea} Yen) en} 

—Ghek Ge PNaeeeeocoNS Ces. 23 

a Ja | eomtamna oD 

—G6-h ao Ney 
re} wD ad 1 

—GL-9 a “HH 29 +H 6010 | ~ 

1D 19 
—G&G-9 Nakiowt+9 A Q 
CO 


WW ID 1H IO 
ra NeNVS 
SSUHRNHH SS 
Aa ha he a Ee Be | 


Totals 


‘oyesnfuoo peuosercy 


66 


XXXV. 


TABLE 


Inter-tuberal. 


XXXIV. 


TABLE 


Posterior sagittal. 


518 


The Measurements of the Pelvis 


seo: bE 


—GkL OT 


aac 


19 19 
aA HNO N 


wd 1 1d le) 


COnmMmoOMo 


pen) 


wd 1d 
Gig © HH OO 
aH 


pes} 


1319 
CHAT AS aH | 
Sere 


wd aw 10 1d 


Noi el eye te) 


jnmacone | | 
a4 


[ol Se Ge) Ge) tox) tex) |- 
taal i 


1d Ad 
—6L IT Me ee alee dP) |. 
aa . . 
Orn Ue 
Natite) 10 
~ fen ie) 
—¢2.IT DOAN eae 
Onn} ae te} 
—— A oN fon 
| —¢L-01 a = 
i] 
a, 19 10 19 10 
—G6-0T siemen te eS 
—GL6 FAH HOOHOHNOO 2 
este nd 
10 10 
—eC7.R as o 
| =1@20 lncowoon or) 
. 19.26 Ke) 1 
| —GL-8 | Mia naomnd & 
Yen) 
—¢r. Le 19 : 
46-8 SE | Pe 3 


| Jonoanaa | | 


“L0119}s0d-0.103UW 


Inter-tuberal. 


TABLE XXXVI. 


M10) Mm OD 


W210 AO D tH CO 


37 | 19 | 6-5 11-5 | 1:5 


G6-01 | | 
am 
—GL-6 Nes 
—er oD 
G6-6 oD 
ie) 
—GL-8 i 
ise) 
XS 
—G6-8 —) 
AN 
(or) 


‘Teq es IOTI9}SOq 


Transverse. 


TABLE XXXVIII. 


APPENDIX III. 
TABLES OF THE DIAMETERS WITH THE BONE MEASUREMENTS. 


Obstetric conjugate. 


CORRELATION 
TABLE XXXVII. 


D. H. DE Souza 519 


al | 
S Onna HN 


i 
| | HA sisi oc | 


19 19 19 us 
—G6-8T | |oawonnhon |a i 
1 8 
—GFL-61T SOI Obs an of | or) 
aan co r= 
' Ne) | ‘ - = = — 
—G6-6I a | tirony | Q | 
a = Z © 
mal 
Nola) fon} fn] 
SLIT aa: | R 


lll] faaal | pes 
ie} ley 
ee Cai | lawoanman|] fo 
re 
g 9 eS 
Bp a | AnoHoOWS | | iS 
i=. = 10 
Oo | —96-6 | | Hot n sma 10 
5 oD 
3 av oD 
SE) ae Ae ANKODDmM | 4 
° = tH 
i) = - 
3 | S 1 
| Go: | FOAOONA | g 
| ef +p) 10 
SAM ONOns ed || = 3 GuKoS cote Ne} 
N 
O10 10.10 F aa 
—¢9e.16} | | | oie | bd | 
—_— 
je te) 
G01] | | oe een | 8 S 
=e | | Ifese nzise we a - 
6-01 CO 09 19 19 6 OD Weve oD rz] 
oO 219 — 4 
—GL6 | loom DOH AH S) faa) 
a = — 
9 io | FF 
—G6-6 | | loo magne 4 | SH 
a . 
Ne) Ne) . —— a 
Pee es mm ie qyBroy—dry Sry 
[ol N 
~~ 
a 
oD 
ie 
ee 
a 
BZ 
3S 
~~ 
i) 
= 
=e. at 


66—2 


Inter-cristal. 


TABLE XL. 


520 The Measurements of the Pelvis 


6 


eo) 


2:5 | 12 


AHONNAHAN 


Ye) 
FHHOKOMA | | 


eee el 
Bes 


“10 19 10 10 Ne) 
| |aoaanaa | | 


34°5 | 18 | 28-5 | 1 


oon | 
me H10 I~ 


24°5 


21919 
DD 1D OD OD OD 


3 | 165 | 16°5 | 30°5 


| 
SRE 
lon) 
2 


(>) 


‘aysroy—diy yysry 


Inter-spinous. 


TABLE XLI. 


| et eaconetesnee 


—G6-66 


1 1D pe) 
AARNONS 


—G&TE 


0 | 3 4 | 8 10 | 135 15 | 36 16-5 | 24 23°5 | 28°5 | 12 


1 


0 


| 
Re 


1°5 | 0°5 


Totals 


‘qystoy—adiy yysry 


521 


D. H. dE Souza 


s[eqO T, 
I — —G6-66 
0 = —GL-18 
| be are I e=niie 28 G61 
| ieee beet a! I I F I | GL-08 
Gc Cul. peels Ol! Rap Te) GE-00 | 
6G — id cole Pe eo 29 ealee | 1-67 | & 
ice eee ee eal cPy cm OR Beek CSP oa A coer lee hee 216 ag 
i | —— eal eal tier € L Te | Rey OTS taal | aie | ail GL-8I | SB: 
cer po |) es | eee Oe) Alle | age 8 
61 | | @|e)] ole |—| 3] {—seer] | 
L ce ce) ae tt bal Se | tee Al ae ae" 
z : | t pax —$L-9T | ga" 
0 all eee = —G5.91 | = 
© GU.GT | 7 
he — 46-91 
STeIOL 
‘aqesnfuoo jeuoseig ‘ATTX ATAVL 
Pe — = , : a a 
| 114-66 |G-LE} 6L | 9-62] 63 19-9] T | @ | 912 GT] STPIOL 
| 1 eS eT —G6-6E 
eee) Hae eo : | ore —GL TB 
T| @ Tees L ae Nie Meee) Glee Gada Tec Ni |) Teed ee | eel ee Mie ore, 
Te eee eee Ale | el eee See Ee Gre Gee ST |e ee oe 
€| ¢ |¢8 | OL] oz 6s |—|—|2|2i\¢orie¢] gi. e| & 19-1 |s-0 | —|—|—]—8.08 
=} 98 G9) 60 eer Gal Weel OG hast ee ie cel ea dsl Cn b {1a call ieee Peat (ew 2 40504 
Te ee CO MRM Gsr yl) wedi aT lee e | e-eGie Gy lO.) 99 ley) y | (= |e lacaren 
BOn) ee) 6 x) Ole O° \'ap| Sf | al eal py | | Ty Sh GG VG-2T.| 9 |) | GB. |sGxT GOW e Ler 
GIO GG ail eo (AOL | 8On Gee Osea): ee) OT A Oey Na re le ac One GisGreia) 195 i 7. TL |) a | | carey. 
— || ie WG Ot) Te Oe ie ea aoe r | O-8 j Ge el 1h fanaa 
SS te oe I Fm ing 0) == a Vai A) Vem cml ne |= i eae el Peet 70 76 
a) SS Se WS Pca Rican Pe ber aah cee lle | : = |W iah en Veseesc Prme| (acaek 00) § 
=| 0 = SS e098 
Ee ae Sea ns |e i | eee ee 
oe | a eal alee el | To 96-07 
Bl o& me mw i aS SaaS oe ee eee ae Se eee aS 
Soe a SO I as SA | SE | ES Se Gs ee ies 
S aL Rl SLR} AlL SPS] Spgs; Rl Sl S/R RlPal S| al} Rj als als 
| | | | | ea Peet rsa ho a eo eae Pl aa | | | Pepe 
esroasuely, “TITTX ATAVL ‘aqesnfuos o11q038qO TTX WIAVL 


‘qys1oy—diq yoy 


Inter-cristal. 


TABLE XLV. 


Or 
bo 
bo 


6 


| |p roar ctinctes 


12°5 


le) 1D 1d 
| ee eauGa etteoi mates 


es) 
eA O10 OOr 


28°5 | 12°5 


laoOman 


18 


1D 19 19 19 
Ber ees 


5 | 34°5 


| 24 


30°5 


16°5 | 16°5 


Yen) 
lee salees 


3 


ANAAR | | 


Lier lail setae al 


Flee al 


eRe eres 
SOSSSSSYYRYLYY 
Win SHAR BDMDABDGONHNAR 
SAAR AAA HRAVQVQNVN 


‘qysroy—adiy yory 


Inter-spinous. 


TABLE XLVI. 


The Measurements of the Pelvis 


1 


AnnAnAe | 


2 y wD 
|| 2.5 


975 |6°5} 1:5 | 1°5 


12 191020 
aamoua lea 


a OD 
aA NOTOM 
al 


Ho haATA 


ad Ad Ne) 
lan#ond 


ee a} 
m1 © 0 6610 HH 


poe) 2S) 
ence) |) coe! | 


12 1D Non} 
Aats lao = 


lonnanaa4 


|otaae | 


| | kvouaist sl 


Peliiteitsree || | 


Onno 3 | 4 | 8 |10 / 135 15 136 |165|24 | 23-5 | 98-5 | 12 


aul 
| | | et 
LW LD 1D LW AD 19 OH 
LERLRS IK GL SL IS 
WHWODKRDDAHDOONHYN 
AANA AANANRRANR 


Totals 


‘qysroy—dr 


| 
a 

® 
4 


Left hip—breadth. 


TABLE XLVIII. 


Right hip—breadth. 


TABLE XLVII. 


D. H. DE Souza 523 


aes 
—GL-91 Lesiiaileatle || | Fea |e . 
s —G6-8T 4 
—G6-9T emirates esl SVE iS eee te aaa 
en) Noma < —— 
9h GT | | | | ream ea ot | | s —Gz.8I 1029 1910.18 Lo) 
- ® shee! SCOranataa a 
Se | | eee 19 i g |— = = tees 
G6-GT lSEqmwwoed (i, ict 3 ; 1 19 
2 —2 | —GheIEY |} 19 OOS R 
ple) 
= l : a —— a 
GATT | | HwtooOode | m~ Pe 5 | =e we 
a 2 5 —G6-TT HOR DHOOD 8 | 
» vu | 
—GE.TT = DOaMm HH | | 13s oD | 
y iP we a || 
= a | SLOT AAP 19 a4 = 
= > Boles x a = 
Gh-ET 5 Die eae weal | a 
; 0 2 18 oS ie: . | 96-05 S 
—G6-&T AAA OMWONOR | | eS pee a 
New} 
10 10.10 1 Fy | —&L-6 ca 
—GLéT| | lan lane | | | | =s - = 
qt -Q 
= a —G6-6 ro) 
te) New) s 
—G6-6T | | eel ele i) id | roo) = 
meopaelal dt | bol | kl bebe 2 
= R 1D 10 1D 1D Yo} 
2S AMR AaHONDDMOAtHOR = 
SGie| ||| a lrele | Pale each 2 Ve a AAtnm aaa S 
SOIC |\ A)" |) SPs el ee a Vi 
iZ) 
= 1019 1019 S 
ae) ADrAHONDDMONHOF | & 
© AN HOMO G a 
& 
10 
Ne} 
re 
S) 
. Gd OE G19 99 HOD 99.19 S 
aS - _ — 
== Ye) 
o0 aA 2S ONS | , 
3 G6-0T | ele Se Sareea | | Be 
ai 
" | 2 Fs ey 10.10 1 
GL-6 i 
C gS le HID OOOKAS HY =H 
pel re | = eb wd id awd ad 
—G6-FT |r adon Sion | Ole 8 Wee Sul esicsisece lt eee | a 
ely ea) 
1D 19 Ke} 1919 Ned —G/.9 ua exbr's) Dehra ost t| 29 
4 AAO 1 > 
Gh EE COAOSHMMADN cs at 
a | “A740 Ne) ve a 
4 1010 © 0-5 | enereorenes =| 
—S6-8T) ta nootanm rs) be 
: 10 
19 1919 TGhek 2 | nN 
(Ranresre carc4, | ae HAAN nN 
v4 ml fae | 
— |- 
— 
Sat 
| 
lay 
mo | 
a D UD UD 1D 1H 1 19 16 14 & LS 4 
=! | eas oe ce 
REHHAASCSHNRAHAHD i= 
ie a | 


; ; -oyeSnft09 911409840) 
‘agesnluos o113048q, 


on 


breadth. 


Left hip 


TABLE LII. 


breadth. 


Right hip 


TABLE LI. 


The Measurements of the Pelvis 


esi ie 
oe ay ME) Fs) 
25 2 | 
Onanna da : 
a oa 
—— — 
2 22 lic AS 
Noronm ina sy 
a 
de) 
cl 
i} 
Les! 
xu 
= 
a 
= 
i= 
| 
ical 
4 
a 
Se 
D 
3 a aS © 
>) NAHORRRRH+r J a 
= ANRMA a 
—=GL-60 | | lok le lieielani pcs 
—=G6-6T | I) etoue S ics 
GL-T] | | | [tense 
he) Yon) 
—G6.TT |] a SF is Ss cS 
wo ey 
GL OF Attar aa g 
1019 
aye ep ey SI 2 
+ G6-O1 HOnmM.oOr +t mal 
a >] 
an pan 
a as 
~ Gar oontan | 
| i) xt 
= Zz 10 
5 —G6-6 ator od fo 
Si ees é. a 
1010 
— “46-8 Hana = 
10 1010 S 
Toke ke a | NRA te nN 
4 10 10 
Goa) | Ore (ae <H! | 
= a eae | 
el GL.9 oe 12 | 
GL9 hae eee I | 
— (=) So | 
4 | 
& | 
— 
ire ee 2 
VK Ie ISesgrs |S iS 
SSHHRAHDSHS a 
SAAN NNN 


“OSHOASUBL T, OSLOASUBIT, 


APPENDIX IV. 
CORRELATION TABLES OF THE BONE MEASUREMENTS WITH ONE ANOTHER. 


TABLE LV. 


Left hip—breadth. 


TABLE LVI. 


D. H. dE Souza 


eon | || Paes Re lealieellers 
ele-9E NW | ee WAY [ees aces 
2 = 
—GheGT | | | | eee sees oe iy 
—G6.4T} | [pareve Tee CeO sca 
t 1 2, 419 
Gin || AA OHM Le os 
—GE.TT 4 ee ‘3 
129 Pete ibe 
—G).ET IDDOHON H 

| 

accor! |i | ae R 


mi) 0 120 rm 


96-81 | PR ON Ti eae Ot eile 
_ | —8L-8r Tie belelilise = pl taleaity ees 
a=} |= 
S | cee | = e 
oe en MOAKoOKMee FA 
E Fae ee oe 
| mee oye a | Howto on | 
5 = a 
=) ; 10 1d 
8 | —96-IT MAS AM MAA | om 
a lee iene eo) 
py - 

—Gh-O1 Sg Sp Oe g 
i = ee 
beat. | le) Ne) 
= —G6-01 aM HOtHAA © 
< =o x 

e * a> 
a | —-GL.6 3 
= piaekasl 
pO 
< 
HH 


ce 
tH 
[Sea Sei) 
S Ne) 
96-97 ll Aalaaastoan | 2 
a a 
op _ ue 
Saas | | maoowon 62 


re} 


Right hip—breadth. 


alt Alba 


WD WW I6 6 Io 
RRWRWKRWR ReILIe 
MSORARHHHASOHYH 
AAAAANAAANNRRNKVR 


WwW IDV ww WwW ah 


a] & 


‘qystoy—dry yy sry 


Biometrika 1x 


height. 


Left hip 


TABLE LVII. 


wD 1D 1D 19 Wd Lp LQ WD 1H UW wD Ip 
ay WA LN VKVKVL VK 
MDSSRAMDDAASSHN 
NAAN NNN RRRVQN 


‘qysroy—drq_ qyouy 


67 


526 The Measurements of the Pelvis 


TABLE LIX. 


Sacrum—height. 


| 


height. 
W ~VWW WwW 
nr if i ie NAN 


on 


— 


| 
Oe. Pe 


Right hip 
 ®& BW WB HHH 
| 
Doers) Sie Se | 


BME OOOT MOA BMHNNVNADQAN 


| 


eS 
| Wists etre | 
— 
or 


Reb j He bo bo 


RWW WW? 
MAMAN 

| wo 
Dre NWR BRNO We 
site || OCo | peor 


| 
ral 
| 


Totals 


TABLE LX. 
Left hip—height. 


al 
RN 
oo 
~ 


Right hip—breadth. 


ia’ 
2 Sy 
PRFES) BRITS REINS) ENG LG) BRIS) RSLS 
AMAAAAAAAAS 


TABLE LXII. Sacrum—breadth. 


breadth. 


Left hip 


TABLE LXI. 


H. dE Souza 


—G6-8T 
—GhL-6T 
Ne} 
—GE.6 | : 
96-61) | Seo | 
MONOCOTS 
Se ILEE | ons 


527 


1 O | 3°5 | 26°5 | 49 | 635 43 | 27°5| 1 1 


pe) 
AOA HOO 


1D 12 19 


La Nee | 


Pee 
At rh 410 SS 


23°5 | 30°5 | 16°5 


wad 


momromoten | 
ee 


MHAOON [= 
al 


36 | 44°5 


& | --96-01 
fel) 

— 

o 

ai —Gh6 
a 

53 | —96-6 
ie 

(2) 

i} 

HN) | —GL8 


9.19 
NAOH Hm 


| 


23 
2¢ 


a 
aA 


| ai 


ll 


19 | 
foal OM tH A La! 


12°5 


Jaa 


4 


en en} 
Id I~ D2 
Lal ak? OR) 
al 


TABLE LXIILI. 


0 | 0:5} 11:5 | 28 


yiprorq—digq 4ysty 


2 
=WIDWIMO BSW WMH 6616 [ES 
MeRNKRNQVKRNWVWKRNVLRQVN = 
TA VR VN 9} 8H WSO 19 © i= 
i he a el 


‘yipeorq—dry ySry 


67—2 


528 The Measurements of the Pelvis 


—S6. ET im 
—GL-6T Lew 
; i - Say | Ne! Ne) 
| —96-6T | |acmomkaa la pw 
= See 
cS Nel 1D ; 
i —GheTT Jancoaodaa | OF 
soe ee 2 ae 
S| —S96-TT oD gs HACONKADHHAROrOR | © 
5 | Ne} 5 AMDHMAN 
5 Es = 
8 By ee 
oa) —GL-GE 4 | | | | le) se ease ees 
= = E ae 
al a —$6-81 emer | fb sos 
4 —oITP || |) laljalaa|a| | fo 
ea 
] a 109 x 
(aa) — G6. TT | | | latHanman | | Co) 
<q | — 
a \ 19 10 
m —GLe OT | | |Hanwoontaa la g 
& HAOONEFRADAHARBOrOH J © 
5 SO HOAN a aS 12 ne) 
= S | 796-01 | |eamdwowe | | | % 
2 2 6 
jl —G6L.6 (Seiten |= | + 
ol fl xt 
q |— a! 
= 5 —G3.6 ure er se oe [le <A 
—GL GT | HaoaAan 3 
NM —GL.8 ae OM~- MANA a6 
wD 1D 
—G6-GT ies t a Altaan aa 
Ye) wD Lie) 
—O Ff, | g : a s 
ie Ghai TT An Or-aArnnan re Gee =1 || loumate | | Q 
Ae 
351 | io | 4 
S | — 96-71 1 > Sy 
is é 0 
—GL8T a 4 ro) 
= | —¢¢.81 R} i a 
1 | 
3 Sal 
I | 92.87 Se = 
[| 
G6-6T B 
‘qystoy—diy yor] 
= | —Sh-11 = 
‘= 
bd | 
& ros) 
x) | 
pa] 
aa 
<< | Te Te ae Tala ele | | 2 
7 BERS RLE oe Eyes ta 
1D 1D DH RR O LDHAASSHHR es 
SAAN NNNNNNRRRAYV 


yysroy—diy yor 


Sacrum—breadth. 


TABLE LXVIII. 


Sacrum—height. 


D. H. DE Souza 529 


GGT IT SS eV ise (its 
ecco ht i) tL ) ee ep aS 
ie) 
aye 12 eT | 
Se-Cm EI IW Scanner ascy | | 
1D 10 1910 
=Qp ie! al | |+oane os | o 
=H 
wd Ne) 
=o |) || || J a+r inwoaoN oD 
aad Ne) 
econ Lesa es ll 1 PS | 
a 
No) 
a 
12 
——GLeTE iran! ; Ao | (oe) 
we i 
—_S6-1T lamas | | fo 
rt 
fl we he} 
S401 latoato | apo 
= an 
A 1 1D Yen} 
Sl OWE Otn OWE aA) 
ase) * 
Ate <a = ene 
OD 1D 1d 1d Key 
oe) i eGo | odo S a + 
se la 20 en 
S 5 —G6-6 | Joma 2 
a S | | eno hot oD 
Q NM pass ol 
ae ; ae ; 2 —G7.8 NAHMHAN o 
ate ton a eo ao aa 
10 19 eG Lane, ol LWaaae a 
Ea aa \ 10 ey 
G46 9 OW DI~ cot 4 = ——————— — 
ah oy | 10.19 
at x G64 oonrn =H 
ae ay OO HOMO RSS Ne} A = _ ees (eee 
G66 _ oD bd 
- = Vecolae| 
29 ~ 
So 4 
& 
ra | | 2 
faa) Yoni Confit Confit Confit Con ih Con ih Con onl Conlon) Et 
< RNR RARE RoR | + 
= DE Se ee ec [ul 
“Up peaIg— UNG 
| 


TABLE LXVII. 


‘yrpeatq—dry yor] 


MISCELLANEA. 


I. The Statistical Study of Dietaries. 


By KARL PEARSON, F.R.S8. 


THERE is not the least doubt that a proper statistical study of diet would be of very great 
value. Such a study is of first class importance when we turn to the accurate discussion of 
working-class budgets. A statistical study of dietaries should consist of two parts. In the 
first section an analysis should be made of the food consumed and of the physiological value 
of its constituents for, say, a hundred working-class families, noting their income, occupation, 
rent, size of house and other details. In the next place an anthropometric, and where possible 
medical survey should be made of the families, their numbers, ages, statures, weights and 
general healths should be recorded, special pathological or disease conditions noted, and some 
record made of the habits of house-mother and father. Only when this has been fully, 
accurately and extensively enough done will it be possible to draw conclusions of real scientific 
value, and to give advice to the working-classes as to the best expenditure of income on foods. 


A recent Report upon a Study of the Diet of the Labouring Classes in the City of Glasgow by 
Miss Dorothy E. Lindsay, issued by the authority of the Corporation and with a preface by 
Professor D. Noel Paton, M.D., has been widely reviewed, and various suggestions made therein 
have been cited as demonstrated facts without any critical examination of the data upon 
which they are based. Thus we have been told that a return to the national dish of porridge 
and milk is an urgent need of the Scottish working-classes, and Miss Lindsay’s data have been 
cited as showing the inadequacy of the dietary of families with regular wages under 20s. per 
week. Miss Lindsay obtained dietaries of 60 working-class Glasgow families, and these dietaries 
show a very large amount of work and a considerable fulness of record. To the extent of 
60 families she has certainly fulfilled the first requisite of an adequate enquiry as to the 
effectiveness of diet. Her families contained 400 individuals, of whom 246 were under 16 years 
of age. If we are to judge of the relative value of the various dietaries, this can only be done 
by discussing their effects on the 400 persons who partook of them. In order to obtain real 
knowledge from Miss Lindsay’s results, this population ought to have been weighed and 
measured. Without this the dietaries seem to us of exceedingly small worth. Actually Miss 
Lindsay did ascertain the height and weight of a few children, put together in Appendix IV. a 
Physical Condition of Children. : 


This contains the heights of 10 girls and 7 boys, and the weights of 20 girls and 16 boys. 
These children are scattered over all ages from 5 to 13 and over the whole of the Groups A to L 
into which Miss Lindsay divides her families. If we only class her groups into three there is not 
even the weight of one boy and one girl for each year of age upon which to base any measure of 
the value of the dietaries! As an illustration of this let us take the important Group E of 
families with regular wages under 20s. Miss Lindsay, p. 18, provides the dietaries of five such 
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families containing 29 individuals, of whom 19 were children under 16. She writes of this 
Group : “In this section, which embraces those who may properly be called poor, not one diet 
reaches the minimum energy value of 3000 calories” (p. 18). Criticising individual diets she 
says: “In LI. the use of a greater proportion of the cheaper vegetable foods, e.g. potatoes and 
haricot beans, gave a better energy value for the same cost....In XXV. the animal protein is 
considerably in excess of the vegetable protein (see Appendix III.). As the former is more 
expensive, this diet could be improved and a higher protein content obtained by reversing the 
proportion of animal and vegetable protein. In view of the fact that the fat is low, necessarily 
so because of its cost, the carbohydrate intake should have been greater.” 


Dr Chalmers in his recent paper before the Royal Society of Medicine, “The House as 
a Contributory Factor in the Deathrate*,” notes that Miss Lindsay’s observations bear so 
directly on the inadequacy of the dietary of persons in the smaller houses that he quotes her 
sample dietaries in families with regular wages under 20s. per week and concludes with Miss 
Lindsay’s remark that “the children are nearly all small and light in weight.” It is not clear 
whether Miss Lindsay intended her remark to apply to Family X XY. only, or as Dr Chalmers 
supposes to the whole of Group E. Dr Chalmers’ interpretation has been widely taken to 
be the true one although which is the correct interpretation matters little: Mor of the in- 
dividuals in this group not a single boy’s weight ts provided by Miss Lindsay, and of the girls in 
the group only two were measured. These two are certainly below weight; they do not belong 
to Family XXV. but one to Family XVII. who shows developed rickets, and whose father has 
been a heavy drinker, and the other to Family LII. where the father suffers from phthisis and 
is in receipt of parish relief. It would therefore appear that Miss Lindsay’s remark must 
apply to the group in general. But the whole demonstration of the effect of the dietaries 
is thus seen to turn on the weight of two individual girls aged 9 and 5 years respectively, 
one of whom has ricketst, and both of whom have degenerate fathers ! 


In order to test what relation, if any, the income of family has to weight of child, the corre- 
lation coefficients of weight (w) and income (7) for constant age have been found for both girls 
and boys for the whole of the 20 and 16 cases respectively provided by Miss Lindsay. We 
havet: 

Weight of Girls and Income for constant Age 
alm = 03 +15, 

Weight of Boys and Income for constant Age 
al = 22 +'16. 


In neither case is there any significance having regard to the probable error. Whether the 
weight of the child is or is not related to the parental income cannot possibly be settled on the 
data provided by Miss Lindsay, and, whether she possesses it or not, she publishes in her paper 
no material on which it would be possible to tell the effect of the dietaries of families with 
incomes under 20s. on the size and weight of the children. 


But there are a great many other question-begging conclusions in Miss Lindsay’s memoir. 
She starts apparently with the opinion that the physiological construction of the food is of no 
importance, and that a “calorie” whether obtained from peas or eggs, beans or beef is of equal 
value. But surely this is the very sort of question that an investigation of this kind should 
answer, not assume? Miss Lindsay criticises—at times severely—the housewife who spends her 


* Proceedings of the Royal Society of Medicine, Vol. v1. (Section Epidemiology and State Medicine), 
pp. 155—181. 

+ Rickets can hardly be due to lowness of ‘‘calories” for some of the families with the highest 
calories (II. with 4003, and IV. with 3882) and high wages (IV. with 50s. 8d., V. with 39s., XX XIX. 
with 41s.) are rickety. 

+ I have most heartily to thank my colleague Miss Ethel M. Elderton for the values of most of the 
correlation coefficients of this note. 
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money on animal rather than on vegetable food, “Protein from vegetable sources is from one 
and a half to two times as much as that from animal sources. In one or two cases it is nearly 
three times as great, and in these diets the energy value received is remarkably high for 
the money laid out....In some of the studies this proportion of apimal to vegetable protein 
is reversed....All these diets show the disadvantages of this large use of the animal food. The 
energy value is low and the cost is high. How are these diets to be improved? Study XIX. 
(Group G) seems to give the answer. In this family ‘porridge was eaten twice a day.’ The 
energy procured per penny spent was the second highest recorded, 619 calories, while it is noted 
that the children were strong, healthy and well-grown” (p. 28). And again : 


“A porridge and milk diet contains the food principles in correct proportion. The protein- 
rich animal foods, flesh, fish, eggs, etc., are all too expensive for the labouring classes, and any 
increase in their amount in the diet is impracticable. But cheese and the cheap protein-rich 
vegetable foods, oatmeal, peas, beans, etc., should be more freely used” (p. 29). An exami- 
nation of Study XIX. thus commended shows that 5:07 shillings were spent on animal, 8°75 
shillings on vegetable food, or the expenditures were as 1 to 1°72, while in the population at 
large 698°77 shillings were spent on animal, 487:28 shillings on vegetable food, or the ratio was 
1 to 0°72. This abnormal ratio is obtained in the family in question by taking no eggs, hardly 
any butter, and instead of milk, buttermilk, skimmilk and condensed milk; there is further 
no fish; the increased vegetable expenditure is on meal and potatoes. Now it would appear 
that before a statement is made that this is the right course to recommend, it is highly desirable 
to ascertain whether the money spent on vegetable or animal food is the more highly correlated 
with physical fitness in the individual. This can only be done with Miss Lindsay’s material for 
the weights of the 20 girls and the 16 boys. Accordingly for these sparse data the correlations 
between 4 the expenditure per individual on animal food and V the expenditure per individual 
on vegetable food with weight of child for constant age were determined. These gave : 


Girls Boys 
a’ Aw gad) Kear) O7+°17, 
a! Vw eee SO tae 30+°15. 


If any stress could be laid on these results, we should have to conclude that money spent 
on animal food would be best in the case of girls, and money spent on vegetable food in the 
case of boys. But an examination of the probable errors shows that no weight whatever can be 
laid on the results. In fact that Miss Lindsay’s data are wholly insufficient to answer the 
question of whether money spent on vegetable or animal food is the better. In fact if we 
take the correlation between weight w and /’ pence per individual spent on food we find : 

Girls Boys 
tinct | ee Deal 22, +°16, 
both of which are low correlations and neither definitely significant having regard to their 


probable error. 


Next the relation of weight in the children to the number of calories in animal food (C4), 
the number of calories in vegetable food (Cy) and to the total number of calories in both (Cr) 
was investigated. 


The following values were found : 


Girls Boys 
al C yw .- “164°15 ‘DL+°12, 
aT Cyw was C20 a 304-15, 

204-14 wy eave 


al Cpw eee ae 
If these results could be trusted at all, we should have to assert: (i) that the number of 
calories in the food is twice as influential in the case of the boy as in that of the girl, and 
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(ii) that the number of calories in the animal food is more influential than the number of 
calories in the vegetable food for the case of the boy, and the reverse is true for the case of the 
girl*, As a matter of fact the size of the probable errors shows us merely that it is very 
unsafe to draw any conclusions at all. On the basis of the scanty data provided by Miss 
Lindsay for the physical fitness of the individuals subjected to the various dietaries, no legiti- 
mate conclusions whatever can be drawn as to how wages or diet affect the individual ; still less 
can it be asserted that a return to porridge and milk, and an avoidance of the purchase of flesh, 
fish and eggs would save the situation. 

Indeed as far as any stress whatever can be laid on her slender material the conclusion to be 
drawn is entirely the other way, i.e. the greater the consumption of animal food relative to veget- 
able food the heavier will be the child. To test this the ratio of money spent on vegetable food 
(V) to the money spent on animal food (4) per individual was correlated with the weight of the 
child for constant age, ie. the partial correlation 47,,7,4 was found. Further as the money 
spent on food might not have been spent to the best advantage the ratio of the number of 
calories in the vegetable food Cy to the number in the animal food C4 was also correlated with the 
weight for constant age, i.e. the partial correlation coefficient q7¢,)c ,,» Was found. The following 
values were determined : 


Girls Boys 
al V/A, w eee — °32 eta 14 —°95 + 16, 
~ 12415 — 234-16. 


a? Cp/C4,w 

Now the correlations are not very large as compared with their probable errors, but 
they have all one sign, and this is negative. In other words the increase of expenditure 
on vegetable food relative to the expenditure on animal food, or the increase in the number 
of calories obtained from vegetable food as compared with the number obtained from animal 
food is associated in every case with decreased weight of the children. Thus we see that as 
far as any conclusion whatever can be drawn from Miss Lindsay’s data, it is directly opposed 
to her statement that it is better to spend money on oatmeal, peas or beans than on flesh, fish, 
eggs, etc. She has started with the dogma that a calorie is of equal value whatever its origin, 
and not stayed to investigate whether it was even justified by her own material. Meanwhile 
her statement as to oatmeal has gone out as if it were a statistically demonstrated fact, whereas 
the only conclusion which would be justified in any measure by her data is that the proportion 
of animal to vegetable food should be kept as high as possible, if weight be taken as a test— 
although but a partial one—of the efficiency of a diet for a growing child. 

The elaborate dietaries carefully worked out by Miss Lindsay are of very small service 
indeed, because they have not been accompanied by any adequate anthropometric record of the 
families thus dieted. The present reviewer wrote at once to Miss Lindsay and later to Professor 
Noel Paton in the hope that it might still be possible to save the situation by some attempt to 
weigh at least the majority of the members of the families whose diets were recorded. This 
appears, however, to be no longer possiblet, and the result is the not unfamiliar one—an 
elaborate piece of investigation has been carried out and practically no safe conclusions can be 
drawn from it. Those concerned in it have not studied beforehand what would be needful | 
statistically to establish an inference, and the sections of the memoir (pp. 28—32) on “'The 
possibility of Improving these Diets” and on “The Relationship of these Diets to Health ”— 
which have been most widely quoted in the popular press—may express either correct or 
incorrect views; the data provided are wholly insufficient to justify any conclusions at all. 


* A little consideration will show that this does not confute the result reached later, because 
in g?c_w and gro, no attention is paid to the relative amounts of animal and vegetable food. 

+ A final appeal to Dr Chalmers, since this paper was in type, has resulted in his most kindly 
attempting all that was possible under the circumstances. Some 64 children of from 5 to 14 years 
of age have been found and measured, and I hope in the next number of this Journal to revise the 
above numerical constants on this larger, but still sadly slender material. 
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II. Note on the Surface of Constant Association. 


By KARL PEARSON, F.R.S. 


In the recent memoir by Dr Heron and myself a reference was made to the future publication 
of this note on the surface of constant association, Biometrika, Vol. 1x, p. 315. The integral 
equation to the general surface was given, and also the particular form it took, still very com- 
plicated, for the simple case of total or ‘marginal’ frequencies being normal distributions. It was 
considered worth while to fully analyse one case of such a surface, namely that of @=0°6 for 
Gaussian marginal frequencies. The numerical calculations were carried out by Miss Julia 
Bell, M.A., and from her ordinates of the sections the sections were plotted by Mr H. E. Soper, 
M.A. with the aid of a Coradi coordinatograph. By interpolation when needful Mr Soper 
constructed the isoplethes of this surface of constant association, and also an excellent card 
model, for comparison with the model of a normal surface. The chief features of this surface 
were referred to in the paper just cited. Although the marginal frequencies are symmetrical, 
the cross sections are skew-curves, the skewness increasing from zero for the central section 
to ‘16 for the y-array when 2=1-5o, and to ‘20 for the y-array when v=3'5o,. 


Diagram I gives the series of sections on one side of the mid-section e=0 up to 7=3'5c, 
by intervals of 0:25o,; the same sections are repeated in inverse order on the other side of 
the central section v=0. It will be seen at once from the indicated means how skew the 
sections are. 


Diagram IT gives the isoplethes or contour-lines of equal frequency. They are approximately 
but not accurately ellipses with common principal axial directions, but they are very far indeed 
from being simzlar ellipses. In the contour corresponding to z=,%, of its maximum value, the 
major axis is considerably more than twice the minor axis of the oval ; in the contour corre- 
sponding to z=;45, the major axis is very much less than twice the minor ; the ovals tend indeed 
to less and less ellipticity. On this diagram are also plotted the regression lines of means and of 
modes. It will be seen that they tend to become parallel to the axis of x, or the regression tends 
to become zero. There is little doubt, that quite apart from normality of the marginal frequen- 
cies, any symmetrical marginal frequency would lead to like results, i.e. the isoplethes would not 
be similar curves, and the regression would be skew, and tend to asymptote to the horizontal. 
Thus the constancy of association would depend for its application on the existence of material 
in which the variates would be intimately related near their mean values and cease to have any 
relation towards extreme values. Intervening values would exhibit every variety of relation-~ 
ship, from the maximum in the neighbourhood of the means to zero value towards the extremes. 
These properties of the surface explain why @ is not even approximately constant, for those 
numerous surfaces of statistical practice in which the regression is approximately constant, 
ie. linear. 


Plates XXX, I (a) and(b), XX XI, I(c) givephotographs of the actual model surface. In Fig. I (a) 
we see the surface ‘end on’ to one set of cross-sections and we grasp readily the skewness of the 
cross-sections and its increasing value as we pass from the central section. Fig. I (b) indicates 
by the verticals the regression line of means; these verticals have their feet on the regression 
line and the eye sees at once by their closer and closer approximation to each other, the deviation 
from linearity. Fig. I (c) gives a diagonal view of the surface of constant association, and under- 
neath it Fig. II (d) has been placed a model of the Gaussian surface* of constant regression for 
comparison ; the angle of deviation was taken roughly about 60°, to give a surface of correlation 
0-5. Except for the regression line and skew-sections in Fig. I, the eye does not distinguish 
very readily in these photographs of Figs. I and II, the fundamental differences of the two 
surfaces which appear so markedly in the isoplethes of Diagram II. 


* On the Brill system of interlaced sections. 
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Surface of Constant ociation. 


Fie. I ()). Surface of Constant Association. 


Biometrika, Vol. |X, Parts III and IV Plate XXXl 


Fic. I(c). Surface of Constant Association. 


Tic. 11 (d). Brill type model of Gaussian surface. 
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The equation to the surface of @=0°6 with normal marginal frequencies, for a total population 
of 1000, is : 
dx dy 
‘ 3 Bi 
(1 +3 (p+g))? — 48pg}# 
p=$(l+a) for x/oz, 


000? “4.3 (p+q)— pq) 


i 


where 
g=}(L+d) for yoy, 


a having the usual significance attributed to it in Tables of the Probability Integral, e.g. 


Sheppard’s. 


Driacram II, Isoplethes of Surface of Constant Association, Q=0°6. 


a1 ae nee iS oe NS pacer) [aver clk rina T T eT eae T 


Line of Means 
(y on x). 

-| Line of Modes 
(y on x) 


‘2:0 


10 


0 


Scale of x/o,- 


The isoplethes are drawn for each tenth of the maximum value of z. 
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The following short table will be sufficient to indicate the nature of the cross-sections to any- 
one wishing to pursue the matter further. 


z-ordinates of y-arrays for given special values of «/ox. 


Array at a/o,= 


B35 0:223 | 0°332|) 0°388 0°314| 0°165 | 0:059 | 0-015 | 0-003 

3:0 1135 | 1°688 1-970 | 1°589) 0834 | 0:297 | 0-077 | 0-015 

25 4-536] 6°736 7811 | 6:223| 3°230 | 1:145 | 0-297 | 0:059 

sig 2-0 14°881 | 21:381| 24-195 | 18-497 | 9-269 | 3-230 | 0-834 | 0-165 
Ga 1'5 | 38-080) 55:053| 57-904 | 39°890 | 18-497 | 6-223 | 1-589 | 0-314 
= 10 85°096 | 113-670 | 100°499 | 57-904 | 24°195 | 7-811 | 1°970 | 0-388 | 
oo 0-5 4 155°888 | 168-289 | 113-670 | 55-053 | 21°381 | 6-736 | 1-688 | 0-332 
om 0:0 1 198°944 | 155-888] 85-096 | 38-080 | 14:881 | 4°536 | 1-°135 | 0-223 
® | —0:5 | 155°888| 97°562| 49°567 | 22°036 | 8-414 | 2°644 | 0-662 | 0°130 | 
= —1:0 85°096 | 49°567 | 25-217 | 11°369| 4-379 | 1-381 | 0°346 | 0-068 | 
s = 15 38-080 | 22°036| 11:369| 5°186| 2-010 | 0°635 | 0-159 | 0:031 
| _ 20 14°881 8414] 4:379| 2:010| 0:781 | 0:247 | 0-062 | 0-012 
—9°5 4536 2644 1°381 | 0°635 | 0°247 | 0-078 | 0-020 | 0-004 | 
—3:0 1°135 0:662) 0°346) 0:159) 0-062 | 0-020 | 0-005 | 0-001 
~3°5 0°223 0130 =0:068 0-031} 0-012 | 0:004 | 0-001 | 0-000 
Means 0:000} 0-268 0-494 | 0°646 | 0:723 | 0°757 | 0°766 | 0°769 | 


III. 


Studies in the Meaning and Relationships of Birth and Death Rates. I. The Relation- 
ship between “corrected” Death-rates and Life Table Death-rates, by JOHN 
Brown es, M.D., D.Sc. Journal of Hygiene, Vol. x1. No. 2, pp. 178—190. 


There are three “death-rates” used by those who deal with the statistics of public health 
etc.; the “crude” death-rate found by dividing the total number of deaths in a district in 
one year by the total number of inhabitants ; the “corrected” death-rate found by applying the 
death-rates for age and sex to a standard population and calculating the rate from the figures 
so found; and the “life table” death-rate which is the rate that would be found by working out 
a complete stationary population from the death-rates for each age and calculating the ratio of 
the total deaths among the assumed stationary population to the total stationary population. 
This result is simply the reciprocal of the “expectation of life” at birth. Dr Brownlee thinks 
that this last measure is the most satisfactory death-rate and his paper is an attempt to reach 
approximate values for it from the corrected death-rates. Although Dr Brownlee does not set 
them out in that way these approximations appear to be based on the use of an imaginary 
stationary population or populations and require, we think, to be tested more extensively than 
has yet been done before they are used for any practical conclusions. 

It appears to us that the use of the “expectation of life” at birth or of a single death-rate 
for a population as a whole cannot, however it is calculated, gauge completely the mortality 
of that population, and cannot therefore form an entirely satisfactory basis for comparative 
purposes : two populations may show the same death-rate or expectation of life and yet the 
mortality of one of them may, for instance, be heavy only in the first five years of life, while - 
the other is light for the first five years but heavy at later ages. There must therefore be 
limitations in the use of any single figure, and if more than one figure is used we do not see why 
the death-rates at various ages should net be employed as they stand. They are easier to 
interpret and no more difficult to deal with than the expectation of life at a series of ages,-—an 
alternative measure implied by Dr Brownlee. 

In-the paper before us Dr Brownlee mentions that further work is to follow, and possibly it 
will show how he intends to overcome these difticulties. Wottee 1s 
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IV. A Case of Familial Malformation in a Fowl’s Head. 


By HUBERT M. TURNBULL, M.D., Director of the Pathological Institute 
of the London Hospital. 


8.D. 2105, 1912. Fow!’s head received from Mr Roxburgh, 10th October, 1912. 


The breeder says that another chicken of the same brood has a similar deformity. Fowl 
examined turned out to be the heaviest of the whole brood, in spite of its crooked beak. 


External appearance. ead and neck of domestic fowl, ?cock. Wattles are red and 
symmetrical, Flat, tongue-like, red comb. Eyes are of equal size. Above the nostril on the 
left side there is a semilunar cartilage 15 cms. long and 0°5 cm. from above down; nostril is 
21 cms. from the anterior canthus. Right nostril is 13 from anterior canthus; semilunar 
cartilage is 0°5 by 0°3.cm. Nostrils on both sides are 2 cms. from the angle of the mouth. 
Horny beak starts on the left side 2°3 cms. from the angle of the mouth, on the right side 
lcm. from the angle. As a result the superior mandible on the left side is convex and the 
upper beak is twisted over to the right so that its point lies 0°2 cm. to the right of the right 
border of the lower beak, at a point 0°7 cm. from the end of the latter. The upper jaw and 
the beak on the right side form a sharp concavity. The anterior border of the orbit on the 
left side appears mnuch farther back than on the right. Tongue is quite symmetrical. On the 
roof of the mouth the palate is divided into three divisions, one side having been cut longi- 
tudinally. The anterior part of the palate corresponding to the beak is not divided. This 
portion is deflected with the beak to the right side. 


The chief alteration in the soft parts is the small size of the right nostril and its 
operculum. <A longitudinal incision had been made, in killing the fowl, through the palate to 
the left of the middle line. 


K 


I. View of skull from III. View of skull from right side. II. View of skull from 
above. Traced from a Traced from a photograph. below. Traced from a 
photograph. photograph. Left pala- 

N=Nasal bones. tal and pterygoid bones 


F=Frontal bones. severed. 
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The head was simmered for five hours and the flesh removed with forceps. Partially cleaned 
in alcohol and chloroform—equal parts. 


Skeleton. The incision in the palate has severed the left palatal and pterygoid bones close 
to their articulation with the vomer and rostrum. The right nostril measures 0°7 cm. long by 
1 cm. high, the left 2 by 0'7 cm. The right frontal anterior to the posterior limit of the orbit is 
3 cms. long and from a breadth posteriorly of 1 cm. tapers gradually ; its anterior extremity 
appears to be of denser structure than that of the left frontal, and is devoid of a process mesial 
to the nasal bone. The left frontal is 2 ems. long and maintains a breadth of about 1 cm. 
This asymmetry throws the anterior part of the skull a little to the left. The superior 
surface of the right nasal bone measures 2°3 by 0°5 cms., the left 2°6 by 0°7 cms. ; the lateral 
process of the right nasal does not share in the hypoplasia. The left nasal bone is thin and 
translucent, the right is thick and opaque. The right jugular and quadrato-jugular measure 
4 cms. as against 4°2 cms. on the left. The right superior maxilla appears to be smaller than 
the left. The shortening of these bones on the right side bends the fore part of the face and 
beak sharply to the right. The other bones are symmetrical. 


The narrow right frontal may be the result of hypoplasia of the right olfactory bulb, in 
consequence of which the soft and bony parts of the right nose may have failed to develop. 
This theory, however, does not give a satisfactory explanation of the elongation of the right 
frontal, the absence of its process mesial to the nasal bone, and the sclerosis of the right 
nasal bone. A more satisfactory explanation of the deformity is that it is the result of a 
hypoplasia of the right nasal bone, the elongation of the right frontal illustrating the inter- 
dependence of growing structures. The sclerosis of the right nasal bone, and apparently also 
of the external frontal process with which it articulates, suggest that the arrest in development 
has been caused by inflammation or trauma. The absence of the mesial process of the frontal 
may be due to involvement of the tissue in which it is formed, in such inflammation or trauma. 


V. Note on a Ready Reckoner for the Obstetrician. 
EDITORIAL. 


On p. 503 of this volume of Biometrika two formulae are given by Dr de Souza, namely: 
(c’) for determining the most probable value of the obstetric conjugate in terms of the known 
values of the diagonal conjugate and of the antero-posterior diameter, and further (2’) giving the 
most probable value of the obstetric conjugate as deduced from the known values of the diagonal 
conjugate and of the height of the right hip-bone. It was proposed to include in Dr de Souza’s 
paper two abacs or ready reckoners for the use of the obstetrician, so that after measuring on 
the living subject the diagonal conjugate and either the antero-posterior diameter or the height 
of the right hip-bone, he could read off from them the probable value of the obstetric conjugate 
without any calculation. These abacs were prepared by Miss H Gertrude Jones, but owing 
to certain difficulties of engraving, it was not found possible to delay the printing off of 
Dr de Souza’s paper till they were completed. The two abacs have only reached us, in but 
moderately successful reproduction just as we go to press, and are included in this final note. 


The method of using them is extremely simple. For example, in the case of Abac I, 
Plate XXXII, suppose the observed antero-posterior diameter to be 11°7 cms. and the diagonal 
conjugate to be 13-4cms. we move upwards between the vertical lines at 11°6 and 11°8 on the 
antero-posterior diameter scale, until we meet the horizontal line at 13-4 cm. on the diagonal 
conjugate scale; we then pass off on the nearest sloping line to the scale of the obstetric 
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conjugate: the required sloping line lies between the 11:4 and 11:5 obstetric conjugate lines 
or the probable value of the obstetric conjugate is 11°45 cms. The actual value found from 
Dr de Souza’s formula (c’) is 11°43 cms. If the vertical antero-posterior diameter line and the 
horizontal diagonal conjugate line intersect in any case below DJ, there‘is possibility of difficulty 
in labour. 


Again, if the diagonal conjugate be 13°4 and the height of the right hip-bone 20:7 cms., we 
enter by the vertical line at 20-7 on the right hip-bone scale of Abac II, Plate XX XIII, and 
the horizontal line on the diagonal conjugate scale at 13°4 and find that they meet on the 
sloping obstetric conjugate line of 11°45 cms. The actual value as found from formula (?’) 
is 11-44. The line of possible difficulty in labour is indicated by DD. The use of a lens 
may occasionally be of assistance, but most obstetricians will find themselves able to read 
the value of the probable obstetric conjugate to at least a millimetre and with a probable 
error of 3 mm., this is ample. Descriptions of the methods of measuring the required lengths 
on the living will be found in Dr de Souza’s paper. 


ADDENDUM. Deformity known as Split-Foot, Biometrika, Vol. 1x. p. 385. 


Annie Eliza 8. (Mrs F.) IV. 9, has given birth a few days ago to a female child, quite normal 


in every respect, October 22, 1918. 


CAMBRIDGE : PRINTED BY JOHN CLAY, M.A. AT THE UNIVERSITY PRESS 
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IV. OBSTETRICIAN’S ABAC I. Plate XXXII. 
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The Inheritance of Ability. By Epcar 
ScuustErR, M.A., D.Sc., Formerly Galton 
Research Fellow, and Erne, M. ELpERTON, 
Galton Scholar. Jssuwed. Price 4s. net. 
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Performance of Manhood. By Epcar 
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of First Cousins. By Eraen M. ELprerton, | 
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Prarson, F.R.S. Issued. Price 3s. 6d. net. 

A First Study of the Inheritance of 
Vision and of the Relative Influence of 
Heredity and Environment on Sight. By 
Amy Barrineton and Karu PEARSON, | 
F.R.S. Jsswed. Price 4s. net. 

Treasury of Human Inheritance 
(Pedigrees of physical, psychical, and patho- 
logical Characters in Man). Parts I and II 
(double part). (Diabetes insipidus, Split- 
Foot, Polydactylism, Brachydactylism, 
Tuberculosis, Deaf-Mutism, and Legal 
Ability.) Zssued. Price 14s. net. 

The Influence of Parental Occupation 
and Home Conditions on the Physique of 
the Offspring. By KEraHen M. Enperron, 
Galton Research Fellow. Shortly. 

The Influence of Unfavourable Home 
Environment and Defective Physique on 
the Intelligence of School Children. By 
Davip Heron, M.A., D.Sc., Formerly Galton - 
Research Fellow. J/ssued. Price 4s. net. 

The Treasury of Human Inheritance 
(Pedigrees of physical, psychical, and patho- 
logical Characters in Man). Part III. 
(Angioneurotic Oedema, Hermaphroditism, 
Deaf-Mutism, Insanity, Commercial Abili- 


ty.) Jssued. Price 6s. net. 


LECTURE SERIES. 


The Scope and Importance to the | 
State of the Science of National Eugenics. | 


By Karu Pearson, F.R.S. Third Edition. 
The Groundwork of Hugenics. By Karn 
Pearson, F.R.S. Second Edition. 
The Relative Strength of Nurture and 
Nature. By Eruen M. ELprerton. 


On the Marriage of First Cousins. By | 


Ere M. ELDERTON. 


The Problem of Practical Hugenics. | 


By Karu Pearson, F.R.S. Second Edition. 


x. 


XI. 


XII. 


XIII. 


XIV. 


XV. 


XVI. 


XVII. 


XVIII. 


The Influence of Parental Alcoholism 
on the Physique and Intelligence of the 
Offspring. By Eruen M. Enprrton, as- 
sisted by Kari Pearson. Jssued. Second 
Edition. Price 4s. net. 

The Treasury of Human Inheritance 
(Pedigrees of physical, psychical, and patho- 
logical Characters in Man).. Part lV. 
(Cleft Palate, Hare-Lip, Deaf-Mutism, and 
Congenital Cataract.) Jsswed. Price 10s. 
net. 

The Treasury of Human Inheritance 
(Pedigrees of physical, psychical, and patho- 
logical Characters in Man). Parts V and 
VI. (Haemophilia.) Jsswed. Price 15s, 
net. 

A Second Study of the Influence of 
Parental Alcoholism on the Physique and 
Intelligence of the Offspring. By Karn 
Prarson, F.R.S., and Erarn M. ELpERTON. 
Issued. Price 4s. net. 

A Preliminary Study of Extreme 
Alcoholism in Adults. By Amy Barrine- 
Ton and Karn Prarson, F.R.S., assisted 
by Davin Hpron, M.A., D.Sc.  Jssued. 
Price 4s. net. 

The Treasury of Human Inheritance. 
Dwarfism, with 49 Plates of Illustrations 
and 8 Plates of Pedigrees. Jsswed. Price 
15s. net. 

The Treasury of Human Inheritance. 
Prefatory matter and indices to Vol. I. 
With Frontispiece Portraits of Sir Francis 
Galton and Ancestry. /ssued. Price 3s. 
net, 

A Second Study of Extreme Alco- 
holism in Adults. With special reference 
to the Home-Office Inebriate Reformatory 
data. By Davin Herron, D.Sc. Lssued. 
Price 4s. net. 

A First Report on the Condition of 
the People from the Standpoint of National 
Eugenics. Shortly. 


Price Is. net each. 


VI. 
VII. 


VIII. 
IX. 


Nature and Nurture, the Problem of 
the Future. By Kart Prarson, F.R.S. 
Second Edition. 

The Academic Aspect of the Science 
of National Eugenics. By Karn Prarson, 
F.R.S. 

Tuberculosis, Heredity and Environ- 
ment. By Karp Pearson, F.R.S. 

Darwinism, Medical Progress and Eu- 
genics. The Cavendish Lecture, 1912. By 
Karu Pearson, F.R.S. 


Prepared in the Biometric Laboratory and obtainable from 
WYMAN AND Sons, Lp. 


Price 9s. 


The English Convict, A Statistical Study. By CHartes Gorine, M.D. 


University of Chicago Press 


THE MECHANISTIC CONCEPTION OF LIFE. Biological 
Essays. By Jacques Logs, Head of the Department of Experi- 
mental Biology, Rockefeller Institute for Medical Research. 


238 pages, 12mo, cloth; 6s. net. 


The achievements of Professor JAcQqUES LoEB in the field of experimental 
biology have made him so widely known as to insure any book of his a large circle 
of readers. His experimental work at the Universities of Chicago and California, 
as well as in his present position, gives this volume an especial significance. 

In this his latest work Professor Loeb presents many of the current problems 
in biology, and discusses the question whether the phenomena of life can be 
explained by physical and chemical laws. He finds it possible to control by 
physical or chemical means not merely the processes of reproduction, but also the 
conduct of animals with reference to environment. 

The New York Medical Journal.— The profession, as well as everyone interested in biology, 


will thank the author and the publishers for collecting these essays and piacing them before the 
reading public.” 


Johns Hopkins Hospital Bulletin.“ All eager to know more of the origin of life will find no 
modern book of its size nearly so instructive or inspiring.” 


AMERICAN PERMIAN VERTEBRATES. By Samuen Wen- 
DELL WILLISTON, Professor of Paleontology in the University of 
Chicago. 


152 pages, 39 plates, 8vo, cloth; 10s. net. 


This work comprises a series of monographic studies with briefer notes and 
descriptions of new or little-known amphibians and reptiles from the Permian 
deposits of Texas and New Mexico. The material upon which these studies are 
based was for the most part collected during recent years by field parties from the 
University of Chicago. The book is offered as a contribution to knowledge on the 
subject of ancient reptiles and amphibians, with such summaries and definitions— 
based chiefly on American forms—as our present knowledge permits. The work 
is illustrated by the author. 


Athenaewm.—‘ The paleontologist will welcome the work as a solid contribution to our 
knowledge of a fauna which is of exceptional interest to the student of evolution, inasmuch as it 


includes forms that help to bridge over some of the differences between reptiles and am- 
i mies 
phibians. 


The publications of the University of Chicago Press, a full catalogue of which will 
be sent on application, are sold in the United Kingdom and the British Colonies by 
THE CAMBRIDGE UNIVERSITY PRESS, Fetter Lane, London, E.C. 


OXFORD UNIVERSITY PRESS 


Sir John Burdon Sanderson. A Memoir. By the late Lavy 
Burpon Sanperson. Completed and edited by his Nephew and Niece. With a 
selection from his papers and addresses. 8vo, with a photogravure frontispiece and 
two other portraits. 10s, 6d. net. 


African Mimetic Butterflies. By H. EvrrincHam. With ten 
coloured plates and a map. Royal 4to. £2. 10s. net. 


Annals of Botany. Edited by I. Baytey Batrour, D. H. Scorv, 
J. B. Farmer, and R. THAxtTER; assisted by other Botanists. Royal 8vo, Morocco 
back, with many plates and illustrations in the text. Subscription price for each 
four parts, 30s., with 1s. 6d. for foreign postage. 


Forestry for Woodmen. By C. O. Hanson.  Llustrated. Crown 


8vo. 5s. net. 


Warming’s Oecology of Plants. English edition by P. Groom 
and J. B. Batrour. Morocco back, 10s. 6d. net; cloth, 8s. 6d. net. 


Knuth’s Handbook of Floral Pollination, based upon Hermann 
Miller’s work, The Fertilization of Flowers by Insects, translated by J. R. 
AinswortH Davis. Vol. I, Morocco back; 21s. net; cloth, 18s. net. Vol. II, 
Morocco back, 35s. net; cloth, 31s. 6d. net. Vol. ILI, Morocco back, 31s. 6d. net ; 
cloth, 28s. net. 


Adler’s Alternating Generations. A Biological Study of Oak 
Galls and Gall Flies. Translated and edited by OC. R. Straron. With coloured 
illustrations of 42 Species. Crown 8vo. 10s. 6d. net. 


London: HUMPHREY MILFORD, Oxford University Press, Amen Corner, E.C. 


Zeitschrift fiir Morphologie und 
Anthropologie. 


Unter Mitwirkung zahlreicher hervorragender Gelehrter herausgegeben 
von Prof. Dr. G. SCHWALBE, Strassburg. 


INHALT DER LETZTERSCHIENENEN HEFTE: 


Bd. XV, Heft 2. J. Sosporra: Der Schidel von La Chapelle-aux-Saints und die Mandibula des 
Homo Heidelbergensis von Mauer. Mit 2 Tafeln und 1 Textfigur—Max Hivznermer: Ein Hundeskelett 
und andere Haustierfunde aus dem 3. oder 4. Jahrh. n. Chr. aus Paulinenaue (Mark). Mit 1 Tafel und 
1 Figur im Text.—A. Masuer: Topographie der Trénendriise der Japaner. Mit 3 Tafeln.—Koronpo 
Hasese: Die Wirbelsdiule der Japaner. Mit 3 Tafeln, 27 Textfiguren und 16 Tabellen.—P. Apuorr: 
Noch einmal die Bolk’sche Hypothese und die Differenzierung des Primatengebisses. Mit 2 Tafeln und 
1 Figur im Text.—V. Grurrripa-Ruccrri: Ueber die endocranischen Furchen der Arteria meningea 
media beim Menschen. Mit 1 Tafel.—Biicherbesprechungen. 

Bd. XV, Heft 3. Micnarnt Reicurr: Untersuchungen iiber die Schadelform der alpenlindischen 
und mongolischen Brachycephalen. I. Zur Charakteristik einiger brachycephaler Schiidelformen. Mit 
26 Textfiguren und 6 Tabellen.—Curt Exze: Zur Anatomie des Gaumenbeines. Mit 1 Figur im Text 
und 15 Figuren auf Taf. XIV.—Koronpo Hasrse: Das quergeteilte Jochbein der Japaner. Mit 
Taf. XV.—Biicherbesprechungen. 

Bd. XVI, Heft 1. Micuarn Retcuer: Untersuchungen iiber die Schadelform der alpenlindischen 
und mongolischen Brachycephalen. II. Vergleich der alpenlindischen brachycephalen Schiidel mit 
den mongoloiden. Mit 11 Tafeln und 19 Textfiguren.—Werner Grazert: Anthropologische Unter- 
suchungen an Herero- und Hottentotten-Kehlkopfen. Mit 2 Textfiguren.—Curist1an Futzer: Rassen- 
anatomische Untersuchungen an 17 Hottentottenképfen. Mit 3 Tafeln und 10 Textfiguren.—N. Gor- 
gatnorr : Die Insula Reili der Esten.—G. Scuwaupe : Ueber O. Abel’s Grundziige der Palaobiologie der 
Wirbeltiere. —Biicherbesprechungen. 

Sonderheft Il. Festschrift zum 70. Geburtstag (am 17. Oktober 1912) von Gustaf Retzius 
gewidmet im Verein mit seinen Mitarbeitern von G. Schwalbe. Mit 1 Portrit, 10 Tafeln und 81 Text- 
figuren. 

Inhalt; G. ScuwauBe: Beitriige zur Kenntnis des menschlichen Magens.—F. WerrprEnreicH: Die 
Lokalisation des Pigmentes und ihre Bedeutung in Ontogenie und Phylogenie der Wirbeltiere.— 
H. Focus: Beitriége zur Entwicklungsgeschichte und vergleichenden Anatomie des Brustschulter- 
apparates der Wirbeltiere. I.—W. Brera: Ueber stummelschwiinzige Katzen und Hunde. . 


EK. SCHWEIZERBARTSCHE VERLAGSBUCHHANDLUNG 
NAGELE & Dr. SPROESSER IN STUTTGART. 


(3) 


WILHELM ENGELMANN 


Verlagsbuchhandlung 
LEIPZIG, Mittelstr. 2 und BERLIN, Neue Wilhelmstr. 8 a. 


Vererbungslehre 


Mit besonderer Beriicksichtigung des Menschen, fiir Studierende, Arzte und 
Ziichter. Von Dr. Ludwig Plate, Professor der Zoologie und Direktor des 
Zoologischen Instituts und des Phyletischen Museums der Universitat Jena. 
Mit 179 Figuren und Stammbéumen im Text und 3 farbigen Tafeln. (Hand- 
biicher der Abstammungslehre Band II.) VIII u. 520 Seiten. Gr. 8. Geheftet 
M. 18.—, Gebunden M. 19.— 


Die antike Tierwelt 


Von Prof. Dr. Otto Keller. Erster Band: Saugetiere. Mit 145 Abbildungen 
im Text und 38 Lichtdrucktafeln. 27 Bogen Gr.8. Geheftet M.10.—. In Leinen 
gebunden M. 11.50. Zweiter Band: Vogel, Reptilien, Fische, Insekten, 
Spinnentiere, TausendfiiBler, Krebstiere, Wiirmer, Weichtiere, Stachelhauter, 
Schlauchtiere. Mit 161 Abbildungen im Text und 2 Lichtdrucktafeln. 39 Bogen 
Gr. 8. Geheftet M.17—. In Lemen gebunden M. 18.50. 


Terminologie 
der Entwicklungsmechanik der Tiere und Pflanzen. 

In Verbindung mit C. Correns, Prof. der Botanik in Miinster, Alfred Fischel, 
Prof. der Anatomie in Prag, E. Kiister, Prof. der Botanik in Bonn, heraus- 
gegeben von Professor Wilhelm Roux. Eine Ergiinzung zu den Worter- 
biichern der Biologie, Zoologie und Medizin sowie zu den Lehr- und Handbiichern 
der Entwicklungsgeschichte, allgemeinen Biologie und Physiologie. 30 Bogen 8. 
In Leinen gebunden M. 10.—. 


Uber kausale und konditionale Weltanschauung 
und deren Stellung zur Entwicklungsmechanik 
von Wilhelm Roux, Prof. a. d. Universitit Halle a. 8. Hine Entgegnungsschrift 


auf die unter gleichem Titel im vorigen Jahre erschienene Schrift des Phy- 
siologen M. Verworn. Geheftet M. 1.50. 


Vorlesungen iiber Vergleichende Tier= und 
Pflanzenkunde 


Zur Einfiihrung fiir Lehrer, Studierende und Freunde der Naturwissenschaften von 
Dr. Adolf Wagner Auferord. Professor an der Universitat Innsbruck. VIII u. 
5188. Gr. 8. Geheftet M. 11.00; in Leinen geb. M. 12.50. 


Einfuhrung in die Vererbungswissenschaft 
In zwanzig Vorlesungen fiir Studierende, Arzte, Ziichter von Dr. Richard Gold- 
schmidt a. o. Professor der Zoologie an der Universitat Miinchen. Mit 161 


Abbildungen im Text. XI und 502 S. Gr. 8. Geheftet M. 11.00; in Leinen 
geb. M. 12.25. 


Vorlesungen tiber vergleichende Anatomie 
von Otto Biitschli, Professor der Zoologie in Heidelberg. In drei Lieferungen. 


Erste Lieferung: Einleitung, vergleichende Anatomie der Protozoen, 


Integument und Skelet der Metazoen. 
Mit den Textfiguren 1-264. VIII, 8S. 1—400. Gr. 8. Geheftet M. 12.—. 


Zweite Lieferung: Allgemeine Korper- und Bewegungsmuskulatur ; 
Elektrische Organe und Nervensystem. 
Mit den Textfiguren 265—451. IV, S. 401—644. Gr.8. Geheftet M. 9.—. 


(4) 


Journal ‘of Anatomy and Physiology 


SIR WILLIAM TURNER, K.C.B. ARTHUR THOMSON, University of Oxford 
ALEX. MACALISTER, University of Cambridge ARTHUR KEITH, Royal College of Surgeons 
ARTHUR ROBINSON, University of Edinburgh 


VOL. XLVIII © 


ANNUAL SUBSCRIPTION 21/- POST FREE 
CONTENTS OF PART I.—OCTOBER 1913 
Jean Murevzyonn, M.B,, Ch.B. On the Innervation of the Nodal Tissue of the. Mammalian Heart. 


Dayip Warersron, M.A., M.D. Reconstruction in Modelling Clay: A Rapid Method of Plastic Recon- 
, struction from Serial Sections. J. Appngron Norrer, B.A., M.D., C.M. Congenital Anomalies of the 


_ | Fifth Lumbar Vertebra and their Consequences. A. G. Timprenu Fisuur, M.B., Ch.B., M.R.0.8., 


L.R.C.P. A Case of Complete Absence of both Internal Carotid Arteries, with a Preliminary Note on 
the Development History of the Stapedial Artery. Recrnanp J. Guapsronz, M.D. Aberd., F.R.C.S. Eng. 
A Case of Congenital Atresia of the Duodenum, accompanied by Volyulus of the Ileum. Huao FiEoxer, 
M.B. Observation upon Cases of Absence of Lacrimal Bones and of Existence of Perilacrimal Ossicles. 
Frepmric Woop Jonus, D.Sc. Some Points in the Nomenclature of the External Genitalia of the 
Female. J.§. B. Sropronp, M.B., Ch.B. A Note on the Significance of Certain Anomalies of the 
Renal and Spermatic Arteries. W. RurtHerrorp. A Swedenborg Mystery: The Rival Skulls. 
T. B. Jonnston, M.B., Ch.B. Extroversion of the Bladder, complicated by the Presence of Intestinal 
Openings on the Surface of the Extroverted Area. ~ ; 


LONDON: CHARLES GRIFFIN AND COMPANY, Lrp., Exeter Street, Strand 


JOURNAL OF THE ROYAL ANTHROPOLOGICAL INSTITUTE 


Vol. XLIII. January—June, 1913 
Contents :— 


Minutes of the Annual General Meeting, January 21st. Presidential Address, Recent Archeological 


_ Discoveries in Mexico. (With Plate I). Dunpss, Hon. Kunneta R. The Wawanga and other Tribes 


of the Elgon District, British Kast Africa. Bennett, F. J., F.G.S., F.R.A.I. Coldrum Monument and 
Exploration, 1910. (With Plate II). Kirn, A.. M.D. Report on the Human Remains found by 
F. J, Bennett, Hsq., F.G.5., in the Central Chamber of a Megalithic Monument at Coldrum, Kent. 
Hoocanrt,'A. M. The Fijian Custom of Tauvu. Hocart, A. M. Fijian Heralds and Envoys. Srannus, 
H.8., M.D. (Lond.), and Davey, J. B., M.D. (Lond.). The Initiation Ceremony for Boys among the 
Yao of Nyasaland. (With Plate III). Houmus, Rev. J. H. A Preliminary Study of the Namau Language, 
Purari Delta, Papua. Brown, A. R. Three Tribes of Western Australia, (With Plate IV). Lys, 
Norman M., M.B., and Joycu, T. A., M.A. Note on a Series of Physical Measurements from Hast 
Africa. Winu1amson, R, W. Some Unrecorded Customs of the Mekeo People of British New Guinea. 
With Plates V, VI). Basspow, Hzrsert, M.D., M.A., B.Sc., F.G.S., ete. Notes on the Natives of 
athurst Island, North Australia, (With Plates VII—XxX). 


WITH TWENTY PLATES AND MANY ILLUSTRATIONS IN THE TEXT. 
PRICH 15s. NET 


: LONDON: THE ROYAL ANTHROPOLOGICAL INSTITUTE, 50, Great Russell Street, W.C. 


or through‘any Bookseller 


MAN 


A MONTHLY RECORD OF ANTHROPOLOGICAL SCIENCE 


Published under the direction of the Royal Anthropological Institute of Great Britain and Ireland. 

Each number of MIAN consists of at least 16 Imp. 8vo. pages, with illustrations in the text 
together with one full-page plate; and includes’ Original Articles, Notes, and Correspondence; Reviews 
and Summaries; Reports of Meetings; and Descriptive Notices of the Acquisitions of Museums and 
Private Collections, : f 

Price, 1s. Monthly or 10s. per Annum prepaid. 


TO BE OBTAINED FROM THE 
‘ROYAL ANTHROPOLOGICAL INSTITUTE, 50, Great Russell Street, W.C. 


AND THROUGH ALL BOOKSELLERS 


CONTENTS — me 
(All Rights reserved) 


I. Anomalies of Pigmentation among Natives of Nyasaland. .A Contribution to the 
Study of Albinism.’ By Hueu Stannus Stannus. (With Plates XII—XXI) 333 


II. On the Expectation of Life in Ancient Rome, and in the Provinces of Hispania 
and Lusitania and Africa. - W. R. esa hear Three aa 
in the text) i 3 3 366 


III. On the Inheritance of the Defosinity known as Split-Foot or Ler Claw. 
(Second Paper.) By Groraz: McMurnan and -Kar- PEARSON. (With 

Plates XXII—XXVII and Two Diagrams in the text) . . Bees ah . 381 
IV. A Study of the Nasal Bridge in the Anthropoid Apes and its Relationship to 
the Nasal Bridge in Man. By Karatren V. Ryiay, Junra Bern and 

: Karu Prarson. (With Eight Diagrams in the text) . ; : aie aos a9) 
V. On the Calculation of Intra-Class and Inter-Class Coefficients of Correlation 
from Class Moments when the number of Possible Combinations is ned 


By J. AntHurR Harris .- . 446 
VI. A Second Study of the Weight, Variability and,Correlation of the Human 
_ Viscera.- By M. GrEENWooD, jun., and J. W. Brown . § Mise . 473 
VII. The Measurements of the Pelvis with special reference to Obstetric Prediction. 
_ By D. H. Di Souza. (With Plates XXVII—XXIX) Mpc” Fe, . 486 
Miscellanea : : 
(i) The Study of Dietaries. By es eee : "de, Sean BOS 
(ii) On the Surface of Constant Association @=0°6. By Kari PEARSON. (With 
lates XXX—XXXI and Two Diagrams in the text) 4 3 3 - . 5384 
(iii) Studies i in the Meaning and Relationships of Birth and Death Rates. Notice ~ 
of a paper by Jonn Brownizn. By W. P. E. . : : : ice NOT 
(iv) .A Case of Familial Malformation in a Fowl’s Head. By Husurr M. TURNBULL. 
(With Three Diagrams in the text) . : , 538 | 
(v) Note on a Ready Reckoner for the Obstetrician. Editorial. win 1 Plates ae 
XXXII-—XXXIT) : 5 : : : virae SEE 8 i iets ws ea 
Addendum ; 3 : Sues : : ake . 540 
pit BeBe eae 


The publication of a paper in Biometrika marks that in the Hditor’s opinion it contains either in 
method or material something of interest to biometricians. But the Editor desires it to be distinctly - 
understood that such publication does not mark assent to the arguments used or to the conclusions 
drawn in the paper. 

Biometrika appears about four times a year. A volume containing about 500 pages, with plates and 
tables, is issued annually. 

Papers for publication and books and offprints for notice should be sent to Professor Karn Pearson, 
University College, London. Ii is very desirable that a copy of all measurements made, not necessarily 
for publication, should accompany each manuscript. In all cases the papers themselves should contain 
not only the calculated constants, but.the distributions from which they have been deduced. Diagrams 
and drawings should be sent in a state suitable for direct photographic reproduction} and if on’ decimal 
paper it should be blue ruled, and the lettering only pencilled. ‘ 

Papers will be accepted in German, French or Italian. In the first case the manuscript should be 
in Roman not German characters. 

Contributors receive 25 copies of their papers’ free. Fifty adciaunal copies may be had on 
payment of 7/- per sheet of eight pages, or part of a sheet of eight pages, with an extra charge for 
Plates; these should be ordered when the final proof is returned. 

The subscription price, payable in advance, is 30s. net per volume (post free); single numbers ~ 
‘10s. net.’ Volumes I, II, III, IV, V, VI, VII, VIII and IX (1902—13) complete, 30s. net per volume, 
Bound in Buckram 34/6 net per volume. Index to Volumes I to V, 2s. net. Subscriptions may be sent 
to C. F, Clay, Cambridge University Press, Fetter Lane, London, E.C., either direct or through any 
bookseller, and: communications respecting advertisements should also be addressed to C. F. Clay. 

Till further notice, new subscribers to Biometrika may obtain Vols. I—IX together for £10 net—or 


bound in Buckram for £12 net. 
te, SHORTLY. 
TABLES FOR STATISTICIANS AND "_ BIOMETRICIANS. 
Epirep sy Karn Pearson. University Press, Cambridge. 


CAMERIDGE;: PRINTED BY JOHN OLAY, M.A, AT THE UNIVERSITY PRESS. 


YL, 


ha ie 
; i 


an 


may 


i Ha ' 

‘ead 
eas 
: iy) 
as 


a ye 


i] 


nit hat ea 


var ae 
nn Bat | 
a 


if 
He 


Peed, 


bo O 
; 
; 


4 i 
Absa 


"WAVNOICIINMEIITON 


3 9088 01230 9860 


